Skip to content

[Benchmark Backfill] Integrate OmniDocBench into lmms-eval#1152

Merged
Luodian merged 1 commit into
dev-v0d7from
feat/lmm-282-omnidocbench
Feb 22, 2026
Merged

[Benchmark Backfill] Integrate OmniDocBench into lmms-eval#1152
Luodian merged 1 commit into
dev-v0d7from
feat/lmm-282-omnidocbench

Conversation

@Luodian
Copy link
Copy Markdown
Contributor

@Luodian Luodian commented Feb 22, 2026

Summary

  • Add omnidocbench benchmark task integration with YAML config and utility functions.
  • Update document-understanding benchmark list in docs/current_tasks.md.
  • Use a loadable OmniDocBench TSV mirror dataset path for reliable task execution.

Validation

  • uv run python -m lmms_eval --model openai --model_args model_version=bytedance-seed/seed-1.6-flash,api_key=$OPENROUTER_API_KEY,base_url=https://openrouter.ai/api/v1 --tasks omnidocbench --batch_size 1 --limit 1

@Luodian Luodian changed the base branch from main to dev-v0d7 February 22, 2026 11:23
@Luodian Luodian force-pushed the feat/lmm-282-omnidocbench branch from ff14627 to 5d5d419 Compare February 22, 2026 11:25
@Luodian Luodian merged commit 44c4f12 into dev-v0d7 Feb 22, 2026
2 checks passed
@Luodian Luodian deleted the feat/lmm-282-omnidocbench branch February 23, 2026 08:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant