Skip to content

Actions: pytorch/torchft

Unit Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
357 workflow runs
357 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[WIP][RFC] Required changes for integration with TorchTitan
Unit Tests #232: Pull request #82 synchronize by fegin
January 28, 2025 18:27 10m 37s chienchin/torchtitan
January 28, 2025 18:27 10m 37s
Add DiLoCo
Unit Tests #231: Pull request #76 synchronize by H-Huang
January 28, 2025 15:57 8m 11s gh/H-Huang/1/head
January 28, 2025 15:57 8m 11s
checkpointing: use CheckpointTransport abstraction (#81)
Unit Tests #230: Commit ccf74d4 pushed by d4l3k
January 28, 2025 01:25 9m 37s main
January 28, 2025 01:25 9m 37s
checkpointing: use CheckpointTransport abstraction
Unit Tests #229: Pull request #81 synchronize by d4l3k
January 28, 2025 00:18 9m 40s d4l3k/checkpoint_transport
January 28, 2025 00:18 9m 40s
[WIP][RFC] Required changes for integration with TorchTitan
Unit Tests #228: Pull request #82 opened by fegin
January 27, 2025 21:30 8m 19s chienchin/torchtitan
January 27, 2025 21:30 8m 19s
process_group: wait for futher_thread join before creating new one (#68)
Unit Tests #227: Commit e177f9c pushed by d4l3k
January 27, 2025 19:24 9m 44s main
January 27, 2025 19:24 9m 44s
checkpointing: use CheckpointTransport abstraction
Unit Tests #226: Pull request #81 synchronize by d4l3k
January 25, 2025 00:37 9m 4s d4l3k/checkpoint_transport
January 25, 2025 00:37 9m 4s
checkpointing: use CheckpointTransport abstraction
Unit Tests #225: Pull request #81 synchronize by d4l3k
January 25, 2025 00:30 9m 13s d4l3k/checkpoint_transport
January 25, 2025 00:30 9m 13s
checkpointing: use CheckpointTransport abstraction
Unit Tests #224: Pull request #81 opened by d4l3k
January 25, 2025 00:05 17m 48s d4l3k/checkpoint_transport
January 25, 2025 00:05 17m 48s
process_group: wait for futher_thread join before creating new one
Unit Tests #223: Pull request #68 synchronize by dwancn
January 24, 2025 06:36 10m 19s dwancn:fix_pg_config
January 24, 2025 06:36 10m 19s
rust: add open telemetry tracing
Unit Tests #222: Pull request #80 synchronize by d4l3k
January 24, 2025 01:57 10m 22s d4l3k/otel
January 24, 2025 01:57 10m 22s
rust: add open telemetry tracing
Unit Tests #221: Pull request #80 opened by d4l3k
January 24, 2025 01:44 10m 39s d4l3k/otel
January 24, 2025 01:44 10m 39s
lighthouse/quorum: make it clear that quorum logs are for next quorum…
Unit Tests #220: Commit beb94f0 pushed by d4l3k
January 23, 2025 19:43 9m 59s main
January 23, 2025 19:43 9m 59s
lighthouse/quorum: make it clear that quorum logs are for next quorum
Unit Tests #219: Pull request #79 opened by d4l3k
January 23, 2025 19:21 13m 53s d4l3k/quorum_logs
January 23, 2025 19:21 13m 53s
process_group: wait for futher_thread join before creating new one
Unit Tests #218: Pull request #68 synchronize by dwancn
January 23, 2025 12:30 9m 54s dwancn:fix_pg_config
January 23, 2025 12:30 9m 54s
[WIP] FSDP example
Unit Tests #217: Pull request #77 synchronize by mreso
January 23, 2025 00:37 8m 23s mreso:fsdp_example
January 23, 2025 00:37 8m 23s
lib: fix Already borrowed (#78)
Unit Tests #216: Commit bed29d2 pushed by d4l3k
January 23, 2025 00:31 14m 21s main
January 23, 2025 00:31 14m 21s
lib: fix Already borrowed
Unit Tests #215: Pull request #78 opened by d4l3k
January 23, 2025 00:13 10m 47s d4l3k/already_borrowed
January 23, 2025 00:13 10m 47s
[WIP] FSDP example
Unit Tests #214: Pull request #77 opened by mreso
January 22, 2025 22:35 17m 39s mreso:fsdp_example
January 22, 2025 22:35 17m 39s
Add DiLoCo
Unit Tests #213: Pull request #76 synchronize by H-Huang
January 21, 2025 22:26 9m 14s gh/H-Huang/1/head
January 21, 2025 22:26 9m 14s
Add DiLoCo
Unit Tests #212: Pull request #76 opened by H-Huang
January 21, 2025 22:17 6m 46s gh/H-Huang/1/head
January 21, 2025 22:17 6m 46s
use torchx for manual many replica (20+) tests (#75)
Unit Tests #211: Commit 39a40b2 pushed by d4l3k
January 18, 2025 05:26 10m 13s main
January 18, 2025 05:26 10m 13s
use torchx for manual many replica (20+) tests
Unit Tests #210: Pull request #75 synchronize by d4l3k
January 18, 2025 00:40 10m 17s d4l3k/torchx
January 18, 2025 00:40 10m 17s
process_group: wait for futher_thread join before creating new one
Unit Tests #209: Pull request #68 synchronize by dwancn
January 17, 2025 03:18 9m 54s dwancn:fix_pg_config
January 17, 2025 03:18 9m 54s
use torchx for manual many replica (20+) tests
Unit Tests #208: Pull request #75 opened by d4l3k
January 16, 2025 22:51 9m 59s d4l3k/torchx
January 16, 2025 22:51 9m 59s