-
Notifications
You must be signed in to change notification settings - Fork 3.5k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Remove cross-rank synchronization during checkpoint load & deprecate torch.distributed.checkpoint.state_dict_loader.load_state_dict
#2864
opened Jan 8, 2026 by
asolergi-nv
Loading…
Cherry-pick bug fixes into 0.15.X.
cherry-pick
core_r0.15.0
#2858
opened Jan 7, 2026 by
cspades
Loading…
6 tasks
Use global user buffer when the bucket size does not fit FixedPoolAllocator
#2857
opened Jan 7, 2026 by
shengf-nv
Loading…
6 tasks
[Dev] Add Qwen3-VL support with Megatron-FSDP
dev branch
Dev branch related issues and development
#2842
opened Jan 7, 2026 by
xuwchen
Loading…
6 tasks
Refactor spec modification/introspection to make references to Submodules typed
community-request
#2834
opened Jan 6, 2026 by
nschank
Loading…
6 tasks
fsdp: avoid double sharding of MoE experts when EP is enabled
community-request
#2833
opened Jan 6, 2026 by
CodersAcademy006
Loading…
Fix: Skip JIT warmup when fusion is disabled via arguments
community-request
#2827
opened Jan 6, 2026 by
kisseternity
Loading…
Add type hints to z_loss_func parameters
community-request
#2821
opened Jan 6, 2026 by
JavaZeroo
Loading…
2 of 6 tasks
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.