Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

docs: add spectral descent / muon article to docs/discussions docs-only documentation only (docs or docstrings)
#3809 opened Mar 11, 2026 by sbhavani Loading…
1 of 5 tasks
Fix backward compatibility issue with MFSDP --grad-reduce-in-bf16 complexity: low Expert Review Apply this label to indicate that your PR is ready for expert review. module: megatron-fsdp
#3799 opened Mar 11, 2026 by shjwudp Loading…
5 tasks
Core 0.16
Exposing interleave argument for fused_apply_rotary_pos_emb_thd complexity: low Final Review PR is in the "final review" stage
#3794 opened Mar 11, 2026 by huvunvidia Loading…
5 tasks
Core 0.16
docs: Update Latest News in README.md docs-only documentation only (docs or docstrings)
#3790 opened Mar 10, 2026 by sbhavani Loading…
1 of 5 tasks
Fix bug in EP sync for Mamba models
#3785 opened Mar 10, 2026 by santhnm2 Draft
5 tasks
Shanmugamr1992/megatron inference ultra Approved All necessary approvals have been made complexity: medium
#3784 opened Mar 10, 2026 by shanmugamr1992 Loading…
5 tasks
Core 0.16
Custom step batch size rampup schedules complexity: medium Final Review PR is in the "final review" stage
#3779 opened Mar 10, 2026 by mkhona-nvidia Loading…
5 tasks
Do not let chunked prefill generate decode tokens complexity: low Final Review PR is in the "final review" stage
#3777 opened Mar 10, 2026 by tdene Loading…
5 tasks
Core 0.16
Track errors through the inference return path Final Review PR is in the "final review" stage
#3776 opened Mar 10, 2026 by tdene Loading…
5 tasks
Core 0.16
ProTip! Mix and match filters to narrow down what you’re looking for.