Skip to content

Agent MoE Experiment: Fixed bias increment (replace QB load balancing) #5232

@ClassicLarry

Description

@ClassicLarry

TL;DR

Replace QB load balancing with a simpler fixed bias increment. Each step, increase bias for underloaded experts and decrease for overloaded ones by a fixed delta.

User prompt

follow agent.md and implement fixed bias increment instead of QB load balancing. That is, on each step increase or decrease each router bias term by 0.01, depending on if the expert is underloaded or overloaded compared to its fair share. Test fixed increment scales of 0.01, 0.05, 0.005, 0.001.

Scope

  • Parent: MoE Scaling up to April goal #4281
  • Branch: moe_fixed_bias_increment
  • Sweep file: experiments/grug/moe/fixed_bias_increment_sweep.py
  • Config: GrugTrainerConfig.fixed_bias_increment

Mechanism

fair_share = total_tokens * K / E
bias[e] += delta * sign(fair_share - count[e])

QB computes an optimal threshold from top-k statistics. This tests whether a simpler sign-based update achieves similar balancing.

Gate 1 runs (8 total)

Delta d512 d768
0.001 pending pending
0.005 pending pending
0.01 pending pending
0.05 pending pending

Decision log

empty

Conclusion

pending

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions