Skip to content

add super_gpqa task#843

Merged
Luodian merged 1 commit into
mainfrom
dev/new-task
Oct 3, 2025
Merged

add super_gpqa task#843
Luodian merged 1 commit into
mainfrom
dev/new-task

Conversation

@pbcong
Copy link
Copy Markdown
Collaborator

@pbcong pbcong commented Sep 29, 2025

lmms-eval test result:
Qwen__Qwen2.5-7B-Instruct

Tasks Version Filter n-shot Metric Value Stderr
super_gpqa 0 none 0 accuracy 0.2924 ± 0.0028

MegaScience reported score: 28.78

@Luodian Luodian merged commit 36dcfbb into main Oct 3, 2025
1 of 2 checks passed
Luodian pushed a commit that referenced this pull request Feb 28, 2026
stisiTT pushed a commit to bgoelTT/lmms-eval that referenced this pull request Mar 6, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants