Skip to content

MMLU PRO chat variant#3568

Open
anmarques wants to merge 7 commits intoEleutherAI:mainfrom
neuralmagic:mmlu-pro-chat-variant
Open

MMLU PRO chat variant#3568
anmarques wants to merge 7 commits intoEleutherAI:mainfrom
neuralmagic:mmlu-pro-chat-variant

Conversation

@anmarques
Copy link
Contributor

This PR implements a variant of MMLU-PRO (mmlu_pro_chat) formatted for proper multi-turn chat (user/assistant per example). Use with --apply_chat_template and --fewshot_as_multiturn for chat models; works as plain-text when those flags are not used.'

…t-as-multiturn

- utils_chat.py: doc_to_text_fewshot, doc_to_text (eval with 'Think step by step.' at end), fewshot_doc_to_target (no 'Answer:' in assistant)
- _mmlu_pro_chat_template_yaml: base template with chat-friendly fewshot split
- 14 subject YAMLs (mmlu_pro_chat_biology through mmlu_pro_chat_psychology)
- _mmlu_pro_chat.yaml: group aggregating all subjects
…ation

- Introduced a new group `mmlu_pro_chat` for fewshot examples formatted for multi-turn chat.
- Listed corresponding multi-turn chat tasks for various subjects in the mmlu_pro dataset.
@anmarques anmarques requested a review from baberabb as a code owner February 6, 2026 21:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant