Open
Conversation
…t-as-multiturn - utils_chat.py: doc_to_text_fewshot, doc_to_text (eval with 'Think step by step.' at end), fewshot_doc_to_target (no 'Answer:' in assistant) - _mmlu_pro_chat_template_yaml: base template with chat-friendly fewshot split - 14 subject YAMLs (mmlu_pro_chat_biology through mmlu_pro_chat_psychology) - _mmlu_pro_chat.yaml: group aggregating all subjects
…utils_chat.py for clearer instructions
…ation - Introduced a new group `mmlu_pro_chat` for fewshot examples formatted for multi-turn chat. - Listed corresponding multi-turn chat tasks for various subjects in the mmlu_pro dataset.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This PR implements a variant of MMLU-PRO (mmlu_pro_chat) formatted for proper multi-turn chat (user/assistant per example). Use with
--apply_chat_templateand--fewshot_as_multiturnfor chat models; works as plain-text when those flags are not used.'