[FIX] Drop only the sample truncated instead of the entire group #744

felipemello1 · 2026-01-29T17:03:15Z

I have personally reviewed this PR and description before asking others to do so. It meets the quality bar I expect from others. I understand that if this PR is perceived as unverified AI-generated code, it will be closed without further explanation.
I have run tests and confirmed that this code works

Description

We want to drop a sample that is truncated, since we cannot properly compute rewards for it
Previously we were dropping all episodes in a group if any episode was truncated
This eventually caused the buffer to never have episodes available

Fix: instead of dropping all samples, we just set the adv of truncated samples to 0.
You may ask: why not just drop them?
Because if our bsz=8, and we drop 1, now we have only 7 samples, and the trainer may need to wait until the next batch to train on the previous one, but at this point, the replay buffer may evict older policies, we would have a mess

Test plan

fix drop truncated

8b463ec

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 29, 2026

set loss mask to 0

7e643e7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIX] Drop only the sample truncated instead of the entire group #744

[FIX] Drop only the sample truncated instead of the entire group #744

Uh oh!

felipemello1 commented Jan 29, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[FIX] Drop only the sample truncated instead of the entire group #744

Are you sure you want to change the base?

[FIX] Drop only the sample truncated instead of the entire group #744

Uh oh!

Conversation

felipemello1 commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

felipemello1 commented Jan 29, 2026 •

edited

Loading