Skip to content

MolmoSpaces Benchmark #3104

@nkalyanv99

Description

@nkalyanv99

Ticket Type

💡 Feature Request / Improvement

Environment & System Info

Description

MolmoSpaces is a new, broad benchmark from Allen AI for robotics-relevant evaluation.
It would be great to expand our supported evaluations with this new benchmark.

Context & Reproduction

No response

Relevant logs or stack trace

Checklist

  • I have searched existing tickets to ensure this isn't a duplicate.
  • I am using the latest version of the main branch.
  • I have verified this is not an environment-specific problem.

Additional Info / Workarounds

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementSuggestions for new features or improvementsevaluationFor issues or PRs related to environment evaluation, and benchmarks.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions