Skip to content

[moe] Make capacity_factor configurable and add sweep script#4056

Closed
claude[bot] wants to merge 1 commit intomainfrom
agent/20260323-fix-4017
Closed

[moe] Make capacity_factor configurable and add sweep script#4056
claude[bot] wants to merge 1 commit intomainfrom
agent/20260323-fix-4017

Conversation

@claude
Copy link
Copy Markdown
Contributor

@claude claude Bot commented Mar 23, 2026

Add capacity_factor as a field on GrugModelConfig (default 1.25, preserving the existing hardcoded constant) and pass it through MoEMLP to moe_mlp() instead of using the module-level constant. Add sweep_capacity_factor.py that runs the trial model at capacity factors {1.0, 1.125, 1.25, 1.5, 2.0} to determine whether the default masks avoidable overflow or throughput loss.

Fixes #4017

…eep script

Add capacity_factor field to GrugModelConfig (default 1.25, matching the
existing hardcoded value) so it can be varied in experiment sweeps. Add
sweep_capacity_factor.py to sweep over {1.0, 1.125, 1.25, 1.5, 2.0}.

Fixes #4017

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@claude claude Bot added the agent-generated Created by automation/agent label Mar 23, 2026
@dlwh dlwh closed this Apr 1, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

agent-generated Created by automation/agent

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[moe] Good 10T: sweep capacity factor

1 participant