[Benchmark] Support OmniBench by jmlee4967 · Pull Request #1327 · open-compass/VLMEvalKit

jmlee4967 · 2025-11-26T03:32:26Z

Multimodal models are increasingly expanding beyond the traditional vision-text modalities to include audio. In addition to Qwen2.5-Omni, which is already supported in VLMEvalKit, the recently released Qwen3-Omni model also handles audio modalities alongside vision (video) and text.
It would be great to see VLMEvalKit expand to support audio modalities when evaluating such models.

This PR adds support for the OmniBench dataset. OmniBench is a task designed to comprehensively analyze image-audio contexts and solve QA in text form.

OmniBench has been widely used for evaluating omni-modality performance in various papers, including the Qwen2.5-Omni paper.

support omnibench

62ec64e

jmlee4967 changed the title ~~support OmniBench~~ [Benchmark] support OmniBench Nov 26, 2025

jmlee4967 changed the title ~~[Benchmark] support OmniBench~~ [Benchmark] Support OmniBench Nov 26, 2025

mzr1996 requested a review from kennymckormick November 28, 2025 10:53

mzr1996 approved these changes Nov 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Benchmark] Support OmniBench#1327

[Benchmark] Support OmniBench#1327
jmlee4967 wants to merge 1 commit intoopen-compass:mainfrom
jmlee4967:omnibench

jmlee4967 commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jmlee4967 commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants