You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Very wonderful work.
I notice that swe-bench evaluation requires files including
eval.sh: The evaluation script
patch.diff: The model's generated prediction
report.json: Summary of evaluation outcomes for this instance
run_instance.log: A log of SWE-bench evaluation steps
test_output.txt: An output of running eval.sh on patch.diff
And in auto code rover we only get the json and patch.diff
how can we get test_output.txt?
Thanks a lot!
The text was updated successfully, but these errors were encountered:
27yw
changed the title
How can the result form into the one for SWE-bench evaluation?
How can the result be reformed into the one for SWE-bench evaluation?
Nov 11, 2024
Hi! You would need to first transform the json into jsonl (with a simple python script for example), then evaluate the jsonl with SWE-bench's containerized evaluation. Then in SWE-bench/logs/ you will find these files.
Very wonderful work.
I notice that swe-bench evaluation requires files including
And in auto code rover we only get the json and patch.diff
how can we get test_output.txt?
Thanks a lot!
The text was updated successfully, but these errors were encountered: