Skip to content

Conversation

ddupont808
Copy link
Contributor

This PR adds support for run_offline_dataset evals, and adds a cell to the hud_hackathon.ipynb notebook with steps on running an offline OSWorld benchmarking (using MMInstruction/OSWorld-G)

@ddupont808 ddupont808 marked this pull request as ready for review September 11, 2025 19:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant