Skip to content

Actions: deepspeedai/DeepSpeed

nv-torch-latest-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,421 workflow runs
4,421 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

nv-torch-latest-v100
nv-torch-latest-v100 #13785: Scheduled
March 23, 2025 00:24 In progress master
March 23, 2025 00:24 In progress
nv-torch-latest-v100
nv-torch-latest-v100 #13784: Merge group checks requested
March 22, 2025 19:19 1h 12m 15s
March 22, 2025 19:19 1h 12m 15s
Avoid graph break by removing redundant requires_grad attr change
nv-torch-latest-v100 #13783: Pull request #7158 synchronize by hwchen2017
March 22, 2025 16:32 1h 9m 11s deepcharm:master
March 22, 2025 16:32 1h 9m 11s
fixed: Modified the topkgating function and modified the test_moe file for testing
nv-torch-latest-v100 #13782: Pull request #7163 synchronize by xiongjyu
March 22, 2025 06:52 Action required xiongjyu:master
March 22, 2025 06:52 Action required
Fix pre-compile on cpu-only machines
nv-torch-latest-v100 #13781: Pull request #7168 opened by AlongWY
March 22, 2025 06:23 Action required AlongWY:patch-1
March 22, 2025 06:23 Action required
nv-torch-latest-v100
nv-torch-latest-v100 #13780: Scheduled
March 22, 2025 00:21 6h 19m 54s master
March 22, 2025 00:21 6h 19m 54s
DeepCompile for enhanced compiler integration
nv-torch-latest-v100 #13779: Pull request #7154 synchronize by tohtana
March 21, 2025 22:07 6h 0m 21s tohtana/deepcompile
March 21, 2025 22:07 6h 0m 21s
Link AutoTP blog in the front page
nv-torch-latest-v100 #13778: Pull request #7167 opened by hwchen2017
March 21, 2025 22:01 1h 8m 7s hongwei_link_autotp_blog
March 21, 2025 22:01 1h 8m 7s
Add DataStates-LLM: Asynchronous Checkpointing Engine Support
nv-torch-latest-v100 #13777: Pull request #7166 synchronize by mauryaavinash95
March 21, 2025 19:14 6h 0m 25s DataStates:dev
March 21, 2025 19:14 6h 0m 25s
Add DataStates-LLM: Asynchronous Checkpointing Engine Support
nv-torch-latest-v100 #13776: Pull request #7166 opened by mauryaavinash95
March 21, 2025 19:14 Action required DataStates:dev
March 21, 2025 19:14 Action required
Add destroy to tests to free memory
nv-torch-latest-v100 #13775: Pull request #7160 synchronize by tohtana
March 21, 2025 15:51 6h 0m 21s tohtana/destroy_model_test_zero
March 21, 2025 15:51 6h 0m 21s
async tp allreduce
nv-torch-latest-v100 #13774: Pull request #7115 synchronize by tjruwase
March 21, 2025 11:25 1h 12m 57s inkcherry:async_tp
March 21, 2025 11:25 1h 12m 57s
[NFC] Typo fix in SP layer.
nv-torch-latest-v100 #13773: Pull request #7152 synchronize by tjruwase
March 21, 2025 11:24 6h 0m 23s c8ef:master
March 21, 2025 11:24 6h 0m 23s
nv-torch-latest-v100
nv-torch-latest-v100 #13772: Merge group checks requested
March 21, 2025 06:33 6h 24m 57s
March 21, 2025 06:33 6h 24m 57s
Variable batch size and LR scheduler
nv-torch-latest-v100 #13771: Pull request #7104 synchronize by tjruwase
March 21, 2025 06:05 1h 2m 38s bm-synth:variable_batch_size_and_lr_2
March 21, 2025 06:05 1h 2m 38s
Improve overflow handling in ZeRO
nv-torch-latest-v100 #13769: Pull request #6976 synchronize by tjruwase
March 21, 2025 06:04 6h 0m 51s olruwase/ds_5241
March 21, 2025 06:04 6h 0m 51s
Enable ZeRO set/get APIs for NVMe offload
nv-torch-latest-v100 #13768: Pull request #7046 synchronize by tjruwase
March 21, 2025 05:58 1h 5m 7s olruwase/update_nvme_offload_states
March 21, 2025 05:58 1h 5m 7s
nv-torch-latest-v100
nv-torch-latest-v100 #13767: Merge group checks requested
March 21, 2025 04:23 32s
March 21, 2025 04:23 32s
fixed: Modified the topkgating function and modified the test_moe file for testing
nv-torch-latest-v100 #13766: Pull request #7163 synchronize by xiongjyu
March 21, 2025 03:59 6h 0m 21s xiongjyu:master
March 21, 2025 03:59 6h 0m 21s
fixed: Modified the topkgating function and modified the test_moe file for testing
nv-torch-latest-v100 #13765: Pull request #7163 reopened by xiongjyu
March 21, 2025 03:58 Action required xiongjyu:master
March 21, 2025 03:58 Action required
nv-torch-latest-v100
nv-torch-latest-v100 #13762: Merge group checks requested
March 21, 2025 03:10 1h 11m 14s
March 21, 2025 03:10 1h 11m 14s
Update sharded_moe.py
nv-torch-latest-v100 #13761: Pull request #7138 synchronize by xiongjyu
March 21, 2025 02:53 Action required xiongjyu:master
March 21, 2025 02:53 Action required
async tp allreduce
nv-torch-latest-v100 #13760: Pull request #7115 synchronize by hwchen2017
March 21, 2025 00:26 Action required inkcherry:async_tp
March 21, 2025 00:26 Action required
nv-torch-latest-v100
nv-torch-latest-v100 #13759: Scheduled
March 21, 2025 00:22 6h 20m 21s master
March 21, 2025 00:22 6h 20m 21s