Skip to content

[moe] Add num_dense_layers to Grug MoE for first-k-dense ablation #78

[moe] Add num_dense_layers to Grug MoE for first-k-dense ablation

[moe] Add num_dense_layers to Grug MoE for first-k-dense ablation #78

Job log options

This job was skipped