[Refactor]: Refactor the evaluation directory #5222

openhands-agent · 2024-11-23T13:10:27Z

What problem or use case are you trying to solve?

Right now in the evaluation directory, the directory structure is very flat, and it is hard to tell which subdirectories are utilities related to implementing benchmarks or doing basic tests for openhands (utils, integration_tests, regression, static), and which are actual benchmarks from the ML literature (everything else).

To make this more clear, we can move all benchmarks to live under the evaluation/benchmarks/ directory. In addition, all other files that have to do with evaluation (including documentation, github workflows, etc.) will need to be checked and changed to maintain consistency.

While we do this, we can also add some of the benchmarks that are missing from the evaluation/README.md documentation.

The text was updated successfully, but these errors were encountered:

neubig · 2024-11-23T13:11:22Z

Sorry, to clarify, this was me accidentally logged in as the openhands-agent account...

github-actions · 2024-11-23T13:12:43Z

OpenHands started fixing the issue! You can monitor the progress here.

github-actions · 2024-11-23T13:16:00Z

A potential fix has been generated and a draft PR #5223 has been created. Please review the changes.

openhands-agent added enhancement New feature or request fix-me Attempt to fix this issue with OpenHands and removed fix-me Attempt to fix this issue with OpenHands labels Nov 23, 2024

neubig added fix-me Attempt to fix this issue with OpenHands and removed enhancement New feature or request labels Nov 23, 2024

All-Hands-AI deleted a comment from github-actions bot Nov 23, 2024

openhands-agent linked a pull request Nov 23, 2024 that will close this issue

Fix issue #5222: [Refactor]: Refactor the evaluation directory #5223

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Refactor]: Refactor the evaluation directory #5222

[Refactor]: Refactor the evaluation directory #5222

openhands-agent commented Nov 23, 2024

neubig commented Nov 23, 2024

github-actions bot commented Nov 23, 2024

github-actions bot commented Nov 23, 2024

[Refactor]: Refactor the evaluation directory #5222

[Refactor]: Refactor the evaluation directory #5222

Comments

openhands-agent commented Nov 23, 2024

neubig commented Nov 23, 2024

github-actions bot commented Nov 23, 2024

github-actions bot commented Nov 23, 2024