Skip to content

Add MoeAdamHHeuristic, drop dense layers, fix align_kv_heads sharding#4636

Open
ClassicLarry wants to merge 8 commits intomainfrom
grug_moe_heuristic
Open

Add MoeAdamHHeuristic, drop dense layers, fix align_kv_heads sharding#4636
ClassicLarry wants to merge 8 commits intomainfrom
grug_moe_heuristic

Commits

Commits on Apr 10, 2026

Commits on Apr 12, 2026

Commits on Apr 13, 2026

Commits on Apr 15, 2026