⛔ Add EOS token to processed input in SFT #3091

qgallouedec · 2025-03-14T21:15:23Z

Learn to generate EOS.

kashif

for the gemma3 generation issue?

HuggingFaceDocBuilderDev · 2025-03-14T21:20:16Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2025-03-14T21:25:05Z

No, actually there have been recurrent reports that SFT can't learn to generate EOS. I'm pretty sure #2405 re-introduced the bug reported in #1623

skandermoalla · 2025-03-15T08:25:34Z

@qgallouedec I've faced this multiple times. I think it's just because of the (not so good) practice in the examples everywhere setting the pad token to the eos token. Then the SFT preprocessing masks everything that's a pad token (=eos token), including the real eos token in the chat template.

skandermoalla · 2025-03-15T08:29:04Z

Personally, I don't think these forced patches are a good design. I understand that you want the Trainers to work out of the box, but users should still make sure they have a chat template that adds an eos properly. In case someone doesn't want an eos they can't do that anymore now.
(Same for the DPOTrainer btw, I think it adds an extra eos token somewhere.)

HwangYej1 · 2025-03-26T08:55:53Z

i got this TypeError: 'Qwen2TokenizerFast' object is not subscriptabl, after change this code

Add EOS token to processed input

661043f

qgallouedec mentioned this pull request Mar 14, 2025

Learning to generate EOS tokens #1623

Closed

qgallouedec requested review from shirinyamani, kashif, edbeeching and lewtun March 14, 2025 21:17

qgallouedec changed the title ~~Add EOS token to processed input in SFT~~ ⛔ Add EOS token to processed input in SFT Mar 14, 2025

kashif approved these changes Mar 14, 2025

View reviewed changes

qgallouedec and others added 2 commits March 14, 2025 15:26

Update sft_trainer.py

ecc9326

fix test

7aef131

qgallouedec merged commit 5cb390c into main Mar 15, 2025
14 checks passed

qgallouedec deleted the fix-eos-sft branch March 15, 2025 01:06

qgallouedec mentioned this pull request Mar 19, 2025

After SFT, the model make repetitions huggingface/open-r1#520

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

⛔ Add EOS token to processed input in SFT #3091

⛔ Add EOS token to processed input in SFT #3091

qgallouedec commented Mar 14, 2025

kashif left a comment

HuggingFaceDocBuilderDev commented Mar 14, 2025

qgallouedec commented Mar 14, 2025 •

edited

Loading

skandermoalla commented Mar 15, 2025 •

edited

Loading

skandermoalla commented Mar 15, 2025

HwangYej1 commented Mar 26, 2025

⛔ Add EOS token to processed input in SFT #3091

⛔ Add EOS token to processed input in SFT #3091

Conversation

qgallouedec commented Mar 14, 2025

kashif left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Mar 14, 2025

qgallouedec commented Mar 14, 2025 • edited Loading

skandermoalla commented Mar 15, 2025 • edited Loading

skandermoalla commented Mar 15, 2025

HwangYej1 commented Mar 26, 2025

qgallouedec commented Mar 14, 2025 •

edited

Loading

skandermoalla commented Mar 15, 2025 •

edited

Loading