-
Notifications
You must be signed in to change notification settings - Fork 2.9k
Pull requests: EleutherAI/lm-evaluation-harness
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: Add hf-mistral3 adapter for Ministral-3 models
#3487
opened Jan 5, 2026 by
medhakimbedhief
Loading…
Implement new translation tasks for google WMT24++ datasets
#3480
opened Dec 25, 2025 by
grzegorz-aniol
Loading…
Fix MGSM stop criteria in Iberian languages to exclude line breaks
#3465
opened Dec 16, 2025 by
juliafalcao
Loading…
Rename every bigbench .yaml to be identified using bigbench as task
#3459
opened Dec 10, 2025 by
MigueXl
Loading…
Fix wrong
gpqa_diamond_generative_n_shot answer template
#3407
opened Nov 15, 2025 by
fxmarty-amd
Loading…
Fix: Prevent infinite loop when max_seq_lengths < 4096 in prepare_niah.py
#3372
opened Oct 28, 2025 by
vnayakde
Loading…
Add support for configurable chrF metric parameters in task YAML, fix…
#3363
opened Oct 23, 2025 by
augustlakia
Loading…
[AIME24 | AIME25] Enable Multiple Generation Repeats with Pass@k and Majority@k Metrics
#3351
opened Oct 17, 2025 by
ihebchaa
Loading…
feat: Add support for accelerate-wrapped models in simple_evaluate()
#3313
opened Sep 26, 2025 by
DhruvaKashyap
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.