Skip to content

Actions: WATonomous/troubleshoot-heterogenous-distributed-operations

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
27 workflow runs
27 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Use pytorch 2.5.1
Create and publish Docker images #27: Pull request #4 synchronize by ben-z
2h 13m 36s benz/torch-251
Use pytorch 2.5.1
Create and publish Docker images #26: Pull request #4 synchronize by ben-z
2h 35m 33s benz/torch-251
Use pytorch 2.5.1
Create and publish Docker images #25: Pull request #4 opened by ben-z
1h 52m 56s benz/torch-251
Install hipblas before rocblas
Create and publish Docker images #24: Commit e1b5908 pushed by ben-z
1h 34m 30s main
Upgrade ROCm to 6.3.3
Create and publish Docker images #23: Commit 7705311 pushed by ben-z
1h 22m 7s main
Build PyTorch from source
Create and publish Docker images #22: Commit aa35a3c pushed by ben-z
2h 10m 22s main
Add WIP pytorch experiments
Create and publish Docker images #21: Commit 5a82d52 pushed by ben-z
2h 37m 32s main
Add instructions for using custom UCC
Create and publish Docker images #20: Commit 058c73f pushed by ben-z
2h 25m 23s main
Add check for invalid rank
Create and publish Docker images #19: Commit 7e80ce8 pushed by ben-z
2h 18m 51s main
Remove ucx transport and interface pinning, and remove debug logs
Create and publish Docker images #18: Commit 3c53749 pushed by ben-z
2h 19m 37s main
Add support for running bidirectional send/recv test on rank0 rocm an…
Create and publish Docker images #17: Commit 64f3819 pushed by ben-z
2h 34m 0s main
Add simple send_recv test
Create and publish Docker images #16: Commit 9b183a6 pushed by ben-z
2h 24m 17s main
Add instructions for running on 2 nvidia GPUs
Create and publish Docker images #15: Commit 3606e90 pushed by ben-z
2h 27m 1s main
Add iproute2 to Dockerfile
Create and publish Docker images #14: Commit 8da1095 pushed by ben-z
2h 17m 51s main
Add instructions for bidirectional_send_recv
Create and publish Docker images #13: Commit d211bba pushed by ben-z
2h 31m 30s main
Use WATO runners to run CI (#3)
Create and publish Docker images #12: Commit 48c5c45 pushed by ben-z
4h 41m 7s main
Use WATO runners to run CI
Create and publish Docker images #11: Pull request #3 synchronize by ben-z
2h 13m 13s ben-z-patch-1
Use WATO runners to run CI
Create and publish Docker images #10: Pull request #3 synchronize by alexboden
3h 18m 34s ben-z-patch-1
Add instructions for setting up containers and running tests
Create and publish Docker images #9: Commit 4a94d88 pushed by ben-z
18m 39s main
Add instructions for setting up containers and running tests
Create and publish Docker images #8: Commit 316d259 pushed by ben-z
18m 25s main
Use WATO runners to run CI
Create and publish Docker images #7: Pull request #3 opened by ben-z
4h 8m 26s ben-z-patch-1
Build rocm image
Create and publish Docker images #6: Commit 1dd8dbc pushed by ben-z
17m 41s main
Rename workflow
Create and publish Docker images #5: Commit 8a5fb11 pushed by ben-z
33m 35s main
Start SSH servers
Create and publish Docker images #4: Commit 9d725da pushed by ben-z
18m 49s main
Install boost separately
Create and publish Docker images #3: Commit d542321 pushed by ben-z
17m 46s main