Skip to content

Agentic GRPO improvements: sampler-IS correction, eval fix, flash att…

49b63f7
Select commit
Loading
Failed to load commit list.
Open

Agentic GRPO: TIS correction, eval dedup, flash-attn segment_ids #1523

Agentic GRPO improvements: sampler-IS correction, eval fix, flash att…
49b63f7
Select commit
Loading
Failed to load commit list.
Google CLA / cla/google succeeded May 17, 2026 in 6s

✅ All contributors are covered under a CLA with Google

See https://cla.developers.google.com/ for more info about Google's Contributor License Agreement (CLA).

ℹ️ Googlers: Go here to view more details and manage scans for this pull request.

Details

The following contributors were found for this pull request:

49b63f7 Author: @colincai-mc <co*****ai​@modelcorp.ai>

(Only the first commit for a unique contributor is listed.)