GitHub

Some example images generated by the model

黎明 (reimei/límíng)

Meanings

daybreak; dawn;
dawn (of a new age)

Training and inference code for reimei, a diffusion model for image generation. Optimized for being highly efficient to train. Cost roughly $6000 to pretrain.

Model features

MoE-MMDiT Blocks (SD3/Flux + DiT-MoE/EC-DiT)
DC-AE autoencoder f64c128. IE trained on a highly compressed latent space with 128 channels
SigLip text encoder
Deferred masking of text tokens to reduce seq len in transformer during train time (256x256 res = 4x4 latent tokens, and 64 text tokens get masked down to 16 tokens. Total 32 tokens at train time. During 1024x1024 res finetuning, masking is removed)
Shared parameters across layers, shares the AdaLN modulation weights, and QKVO projections for attention. (DiT-Air)
Layerwise scaling of MLPs of transformer blocks

Find pretrained weights here

To run training pip install -r requirements.txt setup accelerate using accelerate config then accelerate launch

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
dataset		dataset
lr_graphs		lr_graphs
transformer		transformer
.gitignore		.gitignore
README.md		README.md
best_lr.py		best_lr.py
config.py		config.py
get_scaling_factors.py		get_scaling_factors.py
param_counts.py		param_counts.py
requirements.txt		requirements.txt
tesla.jpg		tesla.jpg
test.ipynb		test.ipynb
test.png		test.png
test_ds.py		test_ds.py
test_model.ipynb		test_model.ipynb
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

黎明 (reimei/límíng)

Meanings

About

Uh oh!

Releases

Packages

Uh oh!

Languages

SwayStar123/reimei

Folders and files

Latest commit

History

Repository files navigation

黎明 (reimei/límíng)

Meanings

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages