Skip to content

[Benchmark] Support OmniBench#1327

Open
jmlee4967 wants to merge 1 commit intoopen-compass:mainfrom
jmlee4967:omnibench
Open

[Benchmark] Support OmniBench#1327
jmlee4967 wants to merge 1 commit intoopen-compass:mainfrom
jmlee4967:omnibench

Conversation

@jmlee4967
Copy link
Copy Markdown

Multimodal models are increasingly expanding beyond the traditional vision-text modalities to include audio. In addition to Qwen2.5-Omni, which is already supported in VLMEvalKit, the recently released Qwen3-Omni model also handles audio modalities alongside vision (video) and text.
It would be great to see VLMEvalKit expand to support audio modalities when evaluating such models.

This PR adds support for the OmniBench dataset. OmniBench is a task designed to comprehensively analyze image-audio contexts and solve QA in text form.

OmniBench has been widely used for evaluating omni-modality performance in various papers, including the Qwen2.5-Omni paper.

@jmlee4967 jmlee4967 changed the title support OmniBench [Benchmark] support OmniBench Nov 26, 2025
@jmlee4967 jmlee4967 changed the title [Benchmark] support OmniBench [Benchmark] Support OmniBench Nov 26, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants