Skip to content

Latest commit

 

History

History
17 lines (12 loc) · 876 Bytes

File metadata and controls

17 lines (12 loc) · 876 Bytes

Megatron Examples

Advanced recipes and configuration overrides for training models using the Megatron-Core backend.

Available Model Recipes

Recipe Key Scripts Description
DiT Pretrain
Inference
Diffusion Transformer (DiT) training on butterfly dataset
Wan Pretrain
Inference
Wan 2.1 model pre-training and inference

Directory Structure

Directory Description
recipes Source code and scripts for the models above
override_configs Configuration overrides for customizing parallelism (TP/CP/SP)