Skip to content

Actions: deepspeedai/DeepSpeed

nv-lightning-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,361 workflow runs
4,361 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Avoid graph break by removing redundant requires_grad attr change
nv-lightning-v100 #14888: Pull request #7158 synchronize by hwchen2017
March 22, 2025 16:32 4m 29s deepcharm:master
March 22, 2025 16:32 4m 29s
fixed: Modified the topkgating function and modified the test_moe file for testing
nv-lightning-v100 #14887: Pull request #7163 synchronize by xiongjyu
March 22, 2025 06:52 Action required xiongjyu:master
March 22, 2025 06:52 Action required
Fix pre-compile on cpu-only machines
nv-lightning-v100 #14886: Pull request #7168 opened by AlongWY
March 22, 2025 06:23 4m 26s AlongWY:patch-1
March 22, 2025 06:23 4m 26s
nv-lightning-v100
nv-lightning-v100 #14885: Scheduled
March 22, 2025 00:21 39m 57s master
March 22, 2025 00:21 39m 57s
DeepCompile for enhanced compiler integration
nv-lightning-v100 #14884: Pull request #7154 synchronize by tohtana
March 21, 2025 22:07 4m 32s tohtana/deepcompile
March 21, 2025 22:07 4m 32s
Link AutoTP blog in the front page
nv-lightning-v100 #14883: Pull request #7167 opened by hwchen2017
March 21, 2025 22:01 4m 35s hongwei_link_autotp_blog
March 21, 2025 22:01 4m 35s
Add DataStates-LLM: Asynchronous Checkpointing Engine Support
nv-lightning-v100 #14882: Pull request #7166 synchronize by mauryaavinash95
March 21, 2025 19:14 4m 34s DataStates:dev
March 21, 2025 19:14 4m 34s
Add DataStates-LLM: Asynchronous Checkpointing Engine Support
nv-lightning-v100 #14881: Pull request #7166 opened by mauryaavinash95
March 21, 2025 19:14 Action required DataStates:dev
March 21, 2025 19:14 Action required
Add destroy to tests to free memory
nv-lightning-v100 #14880: Pull request #7160 synchronize by tohtana
March 21, 2025 15:51 4m 25s tohtana/destroy_model_test_zero
March 21, 2025 15:51 4m 25s
async tp allreduce
nv-lightning-v100 #14879: Pull request #7115 synchronize by tjruwase
March 21, 2025 11:25 14m 38s inkcherry:async_tp
March 21, 2025 11:25 14m 38s
[NFC] Typo fix in SP layer.
nv-lightning-v100 #14878: Pull request #7152 synchronize by tjruwase
March 21, 2025 11:24 4m 30s c8ef:master
March 21, 2025 11:24 4m 30s
nv-lightning-v100
nv-lightning-v100 #14877: Merge group checks requested
March 21, 2025 06:33 13m 40s
March 21, 2025 06:33 13m 40s
Variable batch size and LR scheduler
nv-lightning-v100 #14876: Pull request #7104 synchronize by tjruwase
March 21, 2025 06:05 46m 9s bm-synth:variable_batch_size_and_lr_2
March 21, 2025 06:05 46m 9s
Improve overflow handling in ZeRO
nv-lightning-v100 #14874: Pull request #6976 synchronize by tjruwase
March 21, 2025 06:04 4m 24s olruwase/ds_5241
March 21, 2025 06:04 4m 24s
Enable ZeRO set/get APIs for NVMe offload
nv-lightning-v100 #14873: Pull request #7046 synchronize by tjruwase
March 21, 2025 05:58 4m 26s olruwase/update_nvme_offload_states
March 21, 2025 05:58 4m 26s
nv-lightning-v100
nv-lightning-v100 #14872: Merge group checks requested
March 21, 2025 04:23 6m 44s
March 21, 2025 04:23 6m 44s
fixed: Modified the topkgating function and modified the test_moe file for testing
nv-lightning-v100 #14871: Pull request #7163 synchronize by xiongjyu
March 21, 2025 03:59 4m 26s xiongjyu:master
March 21, 2025 03:59 4m 26s
fixed: Modified the topkgating function and modified the test_moe file for testing
nv-lightning-v100 #14870: Pull request #7163 reopened by xiongjyu
March 21, 2025 03:58 Action required xiongjyu:master
March 21, 2025 03:58 Action required
nv-lightning-v100
nv-lightning-v100 #14867: Merge group checks requested
March 21, 2025 03:10 4m 41s
March 21, 2025 03:10 4m 41s
Update sharded_moe.py
nv-lightning-v100 #14866: Pull request #7138 synchronize by xiongjyu
March 21, 2025 02:53 Action required xiongjyu:master
March 21, 2025 02:53 Action required
async tp allreduce
nv-lightning-v100 #14865: Pull request #7115 synchronize by hwchen2017
March 21, 2025 00:26 Action required inkcherry:async_tp
March 21, 2025 00:26 Action required
nv-lightning-v100
nv-lightning-v100 #14864: Scheduled
March 21, 2025 00:22 41m 19s master
March 21, 2025 00:22 41m 19s
nv-lightning-v100
nv-lightning-v100 #14863: Merge group checks requested
March 20, 2025 23:48 11m 2s
March 20, 2025 23:48 11m 2s
PyDantic updates in preparation for V3
nv-lightning-v100 #14862: Pull request #7161 opened by loadams
March 20, 2025 23:26 8m 23s loadams/pydantic-update
March 20, 2025 23:26 8m 23s