Skip to content

[moe] Add num_dense_layers to Grug MoE for first-k-dense ablation #5742

[moe] Add num_dense_layers to Grug MoE for first-k-dense ablation

[moe] Add num_dense_layers to Grug MoE for first-k-dense ablation #5742

Job log options

This job was skipped