Skip to content

fal-ai-community/nano-mdm

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Nano MDM

Decoding Process Visualization
  • trainer with DDP

  • generation (perplexity / diversity traderoff with reference model?)

  • Sweep learning rate

  • Sweep width and see muP happening

About

Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages