Reduce inconsistency across trainer test files by qgallouedec · Pull Request #5678 · huggingface/trl

qgallouedec · 2026-04-29T01:33:42Z

What changed

Renamed test_training_* → test_train_* (stragglers in DPO, GRPO, RLOO, and 10 experimental files)
Aligned wording: parameters → params, Check the → Check that the
Removed redundant section comments that describe WHAT instead of WHY (# Get the dataset, # Initialize the trainer, # Train the model, etc. — per CLAUDE.md guidance)
Fixed within-file outliers (period style, torch.equal vs torch.allclose, missing inline comments on shared config args)

Note

Low Risk
Test-only refactors (renames, comment cleanup, and assertion consistency) with minor dataset-loading adjustments; low risk outside of potentially changing which splits are exercised in a few tests.

Overview
Improves consistency across trainer test suites by renaming remaining test_training* methods to test_train*, tightening/standardizing assertion messaging, and removing redundant narrative comments.

Also aligns dataset usage in several tests (e.g., consistently using load_dataset(..., split="train") where a single-split dataset is expected) and standardizes parameter-change checks to use torch.equal (and params wording) for clearer, uniform test behavior.

^{Reviewed by Cursor Bugbot for commit c06fb06. Bugbot is set up for automated code reviews on this repo. Configure here.}

HuggingFaceDocBuilderDev · 2026-04-29T01:36:25Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

- Updated assertions in `test_grpo_trainer.py`, `test_reward_trainer.py`, `test_rloo_trainer.py`, and `test_sft_trainer.py` to use `torch.allclose` instead of `torch.equal` for better numerical stability when checking if parameters have changed. - Ensured consistency in assertion messages across all modified tests.

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

There are 2 total unresolved issues (including 1 from previous review).

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit c5a2467. Configure here.}

…am, new_param)`

Reduce inconsistency across trainer test files

7702130

qgallouedec requested a review from albertvillanova April 29, 2026 01:34

qgallouedec assigned kashif Apr 29, 2026

qgallouedec requested a review from AmineDiro April 29, 2026 01:34

qgallouedec unassigned kashif Apr 29, 2026

qgallouedec requested a review from kashif April 29, 2026 13:17

Merge branch 'main' into less-inconsistency

ef0d793

cursor Bot reviewed Apr 29, 2026

View reviewed changes

Comment thread tests/test_dpo_trainer.py Outdated

cursor Bot reviewed Apr 29, 2026

View reviewed changes

Comment thread tests/test_grpo_trainer.py Outdated

use split when possible

c5a2467

cursor Bot reviewed Apr 29, 2026

View reviewed changes

Comment thread tests/test_callbacks.py Outdated

qgallouedec and others added 8 commits April 29, 2026 15:47

not torch.allclose(param, new_param[, ...]) to `not torch.equal(par…

639adc2

…am, new_param)`

dummy_dataset -> dataset

8951719

revert

b886cc2

Merge branch 'main' into less-inconsistency

5728523

Merge branch 'main' into less-inconsistency

f4fdce5

Merge branch 'main' into less-inconsistency

22943c6

Merge branch 'main' into less-inconsistency

a3b720e

Merge branch 'main' into less-inconsistency

c06fb06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce inconsistency across trainer test files#5678

Reduce inconsistency across trainer test files#5678
qgallouedec wants to merge 12 commits intomainfrom
less-inconsistency

qgallouedec commented Apr 29, 2026 •

edited by cursor Bot

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Apr 29, 2026

Uh oh!

Uh oh!

Uh oh!

cursor Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

qgallouedec commented Apr 29, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changed

Uh oh!

HuggingFaceDocBuilderDev commented Apr 29, 2026

Uh oh!

Uh oh!

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

qgallouedec commented Apr 29, 2026 •

edited by cursor Bot

Loading