Skip to content

Pull requests: swiss-ai/Megatron-LM-MoE

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add MLA split support to MDDecoupling
#6 opened Jul 1, 2026 by andresnowak Collaborator Loading…
Apertus moe/experiment framework and scaling ladder
#5 opened Jul 1, 2026 by haeggee Collaborator Loading…
Apertus moe/quantile balancing
#3 opened Jun 27, 2026 by andresnowak Collaborator Loading…
MDDecoupling optimizer support for offloading expert
#1 opened Jun 26, 2026 by FFGGSSJJ Collaborator Loading…
ProTip! What’s not been updated in a month: updated:<2026-06-02.