[Bug]: `swe_bench/eval_infer.py` type checking issues #6336

neubig · 2025-01-17T21:52:41Z

Is there an existing issue for the same bug?

I have checked the existing issues.

Describe the bug and reproduction steps

There are a number of issues in eval_infer.py for swe_bench that would not pass mypy checks.

For instance in this line:
https://github.com/All-Hands-AI/OpenHands/blob/main/evaluation/benchmarks/swe_bench/eval_infer.py#L94

The type of metadata is EvalMetadata | None, but the None type is not accounted for. We should run basic linting and type safety checks to tighten up the code and reduce unexpected errors.

OpenHands Installation

Docker command in README

OpenHands Version

No response

Operating System

None

Logs, Errors, Screenshots, and Additional Context

No response

The text was updated successfully, but these errors were encountered:

openhands-agent · 2025-01-17T21:52:54Z

OpenHands started fixing the issue! You can monitor the progress here.

openhands-agent · 2025-01-17T22:07:24Z

An attempt was made to automatically fix this issue, but it was unsuccessful. A branch named 'openhands-fix-issue-6336' has been created with the attempted changes. You can view the branch here. Manual intervention may be required.

Additional details about the failure:
The issue has not been successfully resolved based on the evidence provided. The AI agent attempted to run mypy type checking on the specified files (eval_infer.py and mapping.py), but the command failed with exit code 1, indicating type checking errors are still present.

The original issue pointed out type safety problems in eval_infer.py, specifically that the metadata variable's potential None value was not being properly handled. The mypy failure suggests these type safety issues have not been fixed yet, as the type checker is still detecting problems. No actual code changes were shown in the provided thread that would address the identified typing issues.

To resolve this issue, code changes would need to be made to properly handle the None case for the metadata variable and any other type safety issues detected by mypy. Since no such changes are evident and the type checker is still failing, the bug remains unresolved.

neubig added bug Something isn't working fix-me Attempt to fix this issue with OpenHands labels Jan 17, 2025

mamoodi added the evaluation Related to running evaluations with OpenHands label Jan 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: `swe_bench/eval_infer.py` type checking issues #6336

[Bug]: `swe_bench/eval_infer.py` type checking issues #6336

neubig commented Jan 17, 2025

openhands-agent commented Jan 17, 2025

openhands-agent commented Jan 17, 2025

[Bug]: swe_bench/eval_infer.py type checking issues #6336

[Bug]: swe_bench/eval_infer.py type checking issues #6336

Comments

neubig commented Jan 17, 2025

Is there an existing issue for the same bug?

Describe the bug and reproduction steps

OpenHands Installation

OpenHands Version

Operating System

Logs, Errors, Screenshots, and Additional Context

openhands-agent commented Jan 17, 2025

openhands-agent commented Jan 17, 2025

[Bug]: `swe_bench/eval_infer.py` type checking issues #6336

[Bug]: `swe_bench/eval_infer.py` type checking issues #6336