IndustrialCoder / .eval_results /swe_bench_verified.yaml
zwpride's picture
Add SWE-Bench Verified evaluation result (#4)
536ba75
raw
history blame contribute delete
216 Bytes
- dataset:
id: SWE-bench/SWE-bench_Verified
task_id: swe_bench_%_resolved
value: 74.8
source:
url: https://huggingface.co/papers/2603.16790
name: IndustrialCoder technical report
user: nielsr