Skip to content

Actions: santhnm2/Megatron-LM

Actions

Create PR to main with cherry-pick from release

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
129 workflow runs
129 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[split 4/4] Enable DSA CP and THD hooks (#5246)
Create PR to main with cherry-pick from release #129: Commit da482cf pushed by santhnm2
9s main
Add RL rollout submission and consumption granularity controls (#5306)
Create PR to main with cherry-pick from release #128: Commit a58373f pushed by santhnm2
1s main
Add MIMO dual gradient finalization (colocated + non-colocated) (#5286)
Create PR to main with cherry-pick from release #127: Commit f8170b4 pushed by santhnm2
1s main
Disag MR1: Add inference shard specs and pg-collection building (#5186)
Create PR to main with cherry-pick from release #126: Commit fc4597c pushed by santhnm2
1s main
Add --mamba-training-ssm-states-dtype argument (#5309)
Create PR to main with cherry-pick from release #125: Commit 6142ee4 pushed by santhnm2
10s main
Add MIMO runtime setup: per-role RNG seeding and DDP wrapping (#5285)
Create PR to main with cherry-pick from release #124: Commit d1410e1 pushed by santhnm2
2s main
Profiling (#3110)
Create PR to main with cherry-pick from release #123: Commit a12484b pushed by santhnm2
6s main
Add full model cuda graph support for MTP inference (#4950)
Create PR to main with cherry-pick from release #122: Commit 1cfa834 pushed by santhnm2
1s main
Allow for pre-bound socket to be passed in server (#5301)
Create PR to main with cherry-pick from release #121: Commit df9141e pushed by santhnm2
9s main
Remove checkpoint-time GPU cache reclaim workaround (#5170)
Create PR to main with cherry-pick from release #120: Commit 3a183e2 pushed by santhnm2
1s main
chore: rotate oncall schedule
Create PR to main with cherry-pick from release #119: Commit 2065b5a pushed by santhnm2
1s main
chore(beep boop 🤖): Bump (main) (2026-06-08)
Create PR to main with cherry-pick from release #118: Commit dbf719b pushed by santhnm2
1s main
Restore Greptile configuration (#5166)
Create PR to main with cherry-pick from release #117: Commit 2944537 pushed by santhnm2
1s main
Add MTP acceptance rate metrics (#3458)
Create PR to main with cherry-pick from release #116: Commit d041544 pushed by santhnm2
2s main
Change the cudagraph distribution from linearly to exponentially-decr…
Create PR to main with cherry-pick from release #115: Commit 16b7194 pushed by santhnm2
2s main
Update oncall reviewer assignment (#5093)
Create PR to main with cherry-pick from release #114: Commit f5da5ea pushed by santhnm2
1s main
fix mimo optimizer checkpoint metadata restore (#4791)
Create PR to main with cherry-pick from release #113: Commit 80cf756 pushed by santhnm2
1s main
[fix] Release MTP assertion when EP overlap with PP=1 (#4796)
Create PR to main with cherry-pick from release #112: Commit 286445c pushed by santhnm2
2s main
ci: Add allow_failure flag to gpt and moe recipes that are failing in…
Create PR to main with cherry-pick from release #111: Commit 859b719 pushed by santhnm2
1s main
Perf tests (#4917)
Create PR to main with cherry-pick from release #110: Commit 686aa8c pushed by santhnm2
2s main
chore: rotate oncall schedule
Create PR to main with cherry-pick from release #109: Commit 38986a9 pushed by santhnm2
4s main
Update golden values for nightly functional tests (#4850)
Create PR to main with cherry-pick from release #108: Commit b2a8ec7 pushed by santhnm2
1s main
Inference: Optimize Prefill Engine Steps for Nemotron (#4764)
Create PR to main with cherry-pick from release #107: Commit 9b4074b pushed by santhnm2
1s main
Update transformer-engine dependency to version 2.15.0 (#4682)
Create PR to main with cherry-pick from release #106: Commit 815c83d pushed by santhnm2
1s main
build(deps): bump nvidia-modelopt to 0.43 (#4723)
Create PR to main with cherry-pick from release #105: Commit 434368c pushed by santhnm2
2s main