Skip to content

Self-hosted runner scale set (AMD mi325 scheduled CI caller) #15

Self-hosted runner scale set (AMD mi325 scheduled CI caller)

Self-hosted runner scale set (AMD mi325 scheduled CI caller) #15

Triggered via workflow run October 27, 2025 03:35
@Flink-dddFlink-ddd
completed 77e8b9f
Status Failure
Total duration 19s
Artifacts
Matrix: DeepSpeed CI / Check Runners
Matrix: Example CI / Check Runners
Matrix: Model CI / Check Runners
Matrix: Torch pipeline CI / Check Runners
Matrix: DeepSpeed CI / Setup
Matrix: DeepSpeed CI / Examples directory
Matrix: DeepSpeed CI / PyTorch pipelines
Matrix: DeepSpeed CI / Torch ROCm deepspeed tests
Matrix: Example CI / Setup
Matrix: Example CI / Examples directory
Matrix: Example CI / PyTorch pipelines
Matrix: Example CI / Torch ROCm deepspeed tests
Matrix: Model CI / Setup
Matrix: Model CI / Examples directory
Matrix: Model CI / PyTorch pipelines
Matrix: Model CI / Torch ROCm deepspeed tests
Matrix: Torch pipeline CI / Setup
Matrix: Torch pipeline CI / Examples directory
Matrix: Torch pipeline CI / PyTorch pipelines
Matrix: Torch pipeline CI / Torch ROCm deepspeed tests
Matrix: DeepSpeed CI / Single GPU tests
Waiting for pending jobs
Matrix: Example CI / Single GPU tests
Waiting for pending jobs
Matrix: Model CI / Single GPU tests
Waiting for pending jobs
Matrix: Torch pipeline CI / Single GPU tests
Waiting for pending jobs
DeepSpeed CI  /  ...  /  Send results to webhook
15s
DeepSpeed CI / Slack Report / Send results to webhook
Example CI  /  ...  /  Send results to webhook
11s
Example CI / Slack Report / Send results to webhook
Model CI  /  ...  /  Send results to webhook
13s
Model CI / Slack Report / Send results to webhook
Torch pipeline CI  /  ...  /  Send results to webhook
14s
Torch pipeline CI / Slack Report / Send results to webhook
Fit to window
Zoom out
Zoom in

Annotations

12 errors
Model CI / Check Runners (2gpu)
The strategy configuration was canceled because "model-ci.check_runners._1gpu" failed
Torch pipeline CI / Check Runners (1gpu)
Required runner group 'amd-mi325-1gpu' not found
Model CI / Check Runners (1gpu)
Required runner group 'amd-mi325-1gpu' not found
Example CI / Check Runners (1gpu)
Required runner group 'amd-mi325-1gpu' not found
DeepSpeed CI / Check Runners (2gpu)
The strategy configuration was canceled because "deepspeed-ci.check_runners._1gpu" failed
Torch pipeline CI / Check Runners (2gpu)
The strategy configuration was canceled because "torch-pipeline.check_runners._1gpu" failed
DeepSpeed CI / Check Runners (1gpu)
Required runner group 'amd-mi325-1gpu' not found
Example CI / Check Runners (2gpu)
Required runner group 'amd-mi325-2gpu' not found
Model CI / Slack Report / Send results to webhook
Process completed with exit code 1.
Torch pipeline CI / Slack Report / Send results to webhook
Process completed with exit code 1.
DeepSpeed CI / Slack Report / Send results to webhook
Process completed with exit code 1.
Example CI / Slack Report / Send results to webhook
Process completed with exit code 1.