Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix: better error handling and message in refit
#1477 opened Nov 6, 2025 by ZhiyuLi-Nvidia Loading…
4 tasks
feat: fp16 for DTensor policies
#1474 opened Nov 5, 2025 by adil-a Loading…
Mmanohara/merge grpo helpsteer cp tp community-request
#1472 opened Nov 4, 2025 by nv-mmanohara Loading…
4 tasks
feat: DTensorPolicyV2 GPT-OSS support CI:L0 Run doctests and unit tests
#1470 opened Nov 4, 2025 by adil-a Loading…
build: Ensure automodel has deepep and TE
#1456 opened Oct 31, 2025 by chtruong814 Loading…
4 tasks
feat: Add GPT-OSS support via mcore
#1452 opened Oct 31, 2025 by ashors1 Draft
4 tasks
feat: Integrate Penguin env logic CI:L0 Run doctests and unit tests
#1450 opened Oct 31, 2025 by bxyu-nvidia Loading…
4 tasks
feat: Fp8 moe rollout
#1446 opened Oct 29, 2025 by guyueh1 Loading…
4 tasks
fix: Fix process_weights_after_loading for fp8 dense CI:L2 Run doctests, unit tests, functional tests, and convergence tests
#1432 opened Oct 27, 2025 by guyueh1 Loading…
4 tasks
feat: enhance advantages tracking and normalization stability in GRPO CI:L1 Run doctests, unit tests, and functional tests r0.4.0
#1423 opened Oct 24, 2025 by ffrujeri Loading…
fix: add theoretical TFlops for H200 GPU CI:L0 Run doctests and unit tests
#1422 opened Oct 24, 2025 by roclark Loading…
4 tasks done
feat: Output buffer cache in megatron->hf generator
#1417 opened Oct 23, 2025 by guyueh1 Loading…
4 tasks
DRAFT: feat: Enable simulated user for multi-turn GRPO
#1412 opened Oct 22, 2025 by ahmadki Loading…
4 tasks
fix: Make the optimizer offloading optional CI:L1 Run doctests, unit tests, and functional tests Performance Related to improving performance
#1404 opened Oct 22, 2025 by youngeunkwon0405 Loading…
4 tasks
feat: Add support for IPO and RPO algorithm community-request documentation Improvements or additions to documentation
#1388 opened Oct 17, 2025 by sanjana-inflection Loading…
1 of 4 tasks
feat: additional validation losses for preference data documentation Improvements or additions to documentation
#1367 opened Oct 15, 2025 by jveronvialard Draft
4 tasks
feat: GSPO-token
#1357 opened Oct 14, 2025 by pjin-nvidia Draft
4 tasks
fix: Use custom chat template also in VLM's
#1344 opened Oct 11, 2025 by jseppanen Loading…
docs: Refactor Home Page and New About Section documentation Improvements or additions to documentation
#1338 opened Oct 10, 2025 by jgerh Loading…
feat: Onboard perf recipes in tests
#1322 opened Oct 8, 2025 by guyueh1 Loading…
4 tasks
feat: [Draft Do Not merge] Kitchen interface
#1310 opened Oct 8, 2025 by guyueh1 Draft
4 tasks
ProTip! Updated in the last three days: updated:>2025-11-03.