Skip to content

Add MoeAdamHHeuristic, drop dense layers, fix align_kv_heads sharding #30

Add MoeAdamHHeuristic, drop dense layers, fix align_kv_heads sharding

Add MoeAdamHHeuristic, drop dense layers, fix align_kv_heads sharding #30

resolve

succeeded Apr 10, 2026 in 3s