Skip to content

[grug] Multi-host MoE training fails with ShardingTypeError on shared-expert residual add #4309

[grug] Multi-host MoE training fails with ShardingTypeError on shared-expert residual add

[grug] Multi-host MoE training fails with ShardingTypeError on shared-expert residual add #4309

Triggered via issue June 9, 2026 04:28
Status Skipped
Total duration 2s
Artifacts

ops-claude.yaml

on: issue_comment
Fit to window
Zoom out
Zoom in