Skip to content

[moe] Add num_dense_layers to Grug MoE for first-k-dense ablation #1102

[moe] Add num_dense_layers to Grug MoE for first-k-dense ablation

[moe] Add num_dense_layers to Grug MoE for first-k-dense ablation #1102

Job log options

This job was skipped