Skip to content

[moe] Add num_dense_layers to Grug MoE for first-k-dense ablation #3960

[moe] Add num_dense_layers to Grug MoE for first-k-dense ablation

[moe] Add num_dense_layers to Grug MoE for first-k-dense ablation #3960

Job log options

This job was skipped