Description
TL;DR: Sweep E over {128,256,512} within the good-enough 10T gate instead of assuming the current expert count is settled.
Hypothesis or Goal
We want to know whether expert count is one of the main remaining levers for the baseline recipe.
Links
Results
Description
TL;DR: Sweep E over {128,256,512} within the good-enough 10T gate instead of assuming the current expert count is settled.
Hypothesis or Goal
We want to know whether expert count is one of the main remaining levers for the baseline recipe.
Links
Results