Agent MoE Experiment: Fixed bias increment (replace QB load balancing)

## TL;DR
Replace QB load balancing with a simpler fixed bias increment. Each step, increase bias for underloaded experts and decrease for overloaded ones by a fixed delta.

## User prompt
> follow agent.md and implement fixed bias increment instead of QB load balancing. That is, on each step increase or decrease each router bias term by 0.01, depending on if the expert is underloaded or overloaded compared to its fair share. Test fixed increment scales of 0.01, 0.05, 0.005, 0.001.

## Scope
- **Parent**: #4281
- **Branch**: `moe_fixed_bias_increment`
- **Sweep file**: `experiments/grug/moe/fixed_bias_increment_sweep.py`
- **Config**: `GrugTrainerConfig.fixed_bias_increment`

## Mechanism
```
fair_share = total_tokens * K / E
bias[e] += delta * sign(fair_share - count[e])
```
QB computes an optimal threshold from top-k statistics. This tests whether a simpler sign-based update achieves similar balancing.

## Gate 1 runs (8 total)
| Delta | d512 | d768 |
|-------|------|------|
| 0.001 | pending | pending |
| 0.005 | pending | pending |
| 0.01 | pending | pending |
| 0.05 | pending | pending |

## Decision log
_empty_

## Conclusion
_pending_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agent MoE Experiment: Fixed bias increment (replace QB load balancing) #5232

TL;DR

User prompt

Scope

Mechanism

Gate 1 runs (8 total)

Decision log

Conclusion

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Delta	d512	d768
0.001	pending	pending
0.005	pending	pending
0.01	pending	pending
0.05	pending	pending

Agent MoE Experiment: Fixed bias increment (replace QB load balancing) #5232

Description

TL;DR

User prompt

Scope

Mechanism

Gate 1 runs (8 total)

Decision log

Conclusion

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions