Fix correctness issues in Arabic normalization and prompt loading by RinZ27 · Pull Request #3589 · EleutherAI/lm-evaluation-harness

RinZ27 · 2026-02-15T07:17:35Z

Several correctness issues were identified during a deep dive into the codebase, specifically affecting Arabic normalization, prompt loading, and logging hygiene.

Key changes:

Corrected the Arabic definite article removal regex in mlqa/utils.py. The previous regex had a misplaced caret and was overly aggressive, which could lead to corrupted word forms.
Added an else block in lm_eval/prompts/__init__.py to provide a clearer error message when an unknown prompt category is used, preventing a potential UnboundLocalError.
Removed a debug print(prompt) statement in med_prescriptions/utils.py to keep evaluation logs clean and protect potential PII in medical datasets.
Cleaned up redundant variable assignments (e.g., x = x) in the ruler task module to improve code clarity.

Verified the fixes by running the test_utils.py suite and confirmed everything passes correctly. These improvements directly benefit evaluation accuracy and project robustness.

CLAassistant · 2026-02-15T07:17:41Z

All committers have signed the CLA.

Fix correctness issues in Arabic normalization and prompt loading

cb0657a

RinZ27 requested a review from baberabb as a code owner February 15, 2026 07:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix correctness issues in Arabic normalization and prompt loading#3589

Fix correctness issues in Arabic normalization and prompt loading#3589
RinZ27 wants to merge 1 commit intoEleutherAI:mainfrom
RinZ27:fix-correctness-and-leaks

RinZ27 commented Feb 15, 2026

Uh oh!

CLAassistant commented Feb 15, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

RinZ27 commented Feb 15, 2026

Uh oh!

CLAassistant commented Feb 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CLAassistant commented Feb 15, 2026 •

edited

Loading