Skip to content

Add MoeAdamHHeuristic, drop dense layers, fix align_kv_heads sharding #30

Add MoeAdamHHeuristic, drop dense layers, fix align_kv_heads sharding

Add MoeAdamHHeuristic, drop dense layers, fix align_kv_heads sharding #30

Annotations

1 warning

build

succeeded Apr 10, 2026 in 41s