Skip to content

v0.18.0

Compare
Choose a tag to compare
@rithwik-db rithwik-db released this 18 Mar 18:31

What's Changed

  • Torch has been bumped to 2.6.0 (in #1740)
    • Sparse support has been disabled in the latest megablocks version (as part of the latest torch upgrade) and we cascaded those disables to llm-foundry as well (for more details, view the megablocks release)
  • TransformerEngine has been removed from the all dependency group due to version compatibility issues (in #1742). We expect to add this back in a future release.
  • Transformers has been bumped to v4.49.0 (in #1735) and this would result in the master weights being torch.bfloat16 (view huggingface/transformers#36567 for more context). llm-foundry doesn't support master weights in lower precision, so we manually hardcoded this to torch.float32 when loading in #1734.

Detailed Changes

New Contributors

Full Changelog: v0.17.1...v0.18.0