Scaling Recipes

Scaling recipes is a project for understanding best practices for scaling neural networks for different tasks.

Scope

Classification on MNIST.
- Implement Standard Parametrization (SP).
- Implement Maximal Update Parametrization (muP).
- Evaluate performance of different parametrizations by varying different aspects like lr, width etc.
Flow matching on a toy dataset.
- Implement Standard Parametrization (SP).
- Implement Maximal Update Parametrization (muP).
- Evaluate performance of different parametrizations by varying different aspects like lr, width etc.

Installation

python -m venv venv 
source venv/bin/activate

pip install .
pip install -e .
pip install -e ".[dev]"

Usage

Config

The config file can be found at: slfm/cli/conf/base.yaml

Train and evaluate

Sample command to train and evaluate the model:

width=120
lr=0.01
train_and_evaluate  "++model.width=${width}" "++trainer.optimizer.lr=${lr}"

Sample command to run a sweep through different lr and widths with different parametrizations:

sweep

Expected outcome

Key observation:

muP shows more consistent convergence behavior allowing better HP transfer capabilities.

Flow matching

TBD.

Thanks

The project starts from a very cool notebook on flow matching. A lot of the code for scaling is borrowed from this guide.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
images		images
scaling_recipes		scaling_recipes
README.md		README.md
flow_matching.png		flow_matching.png
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Scaling Recipes

Scope

Installation

Usage

Config

Train and evaluate

Expected outcome

Flow matching

Thanks

About

Uh oh!

Releases

Packages

Uh oh!

Languages

VigneshSrinivasan10/scaling-recipes

Folders and files

Latest commit

History

Repository files navigation

Scaling Recipes

Scope

Installation

Usage

Config

Train and evaluate

Expected outcome

Flow matching

Thanks

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages