Skip to content

[moe] Add num_dense_layers to Grug MoE for first-k-dense ablation #175

[moe] Add num_dense_layers to Grug MoE for first-k-dense ablation

[moe] Add num_dense_layers to Grug MoE for first-k-dense ablation #175

Annotations

1 warning

grug-variant-diff

succeeded Mar 23, 2026 in 20s