NVIDIA BioNeMo Framework v2.5
New Features
- Adding the Evo2 model training workflow, including data preprocessing, pre-training, fine-tuning and inference with bf16 and fp8 support.
Updates & Improvements
- Supporting/upgrading federated learning examples of BioNeMo in NVFlare
- Upgrade bionemo-moco to v0.0.2
- Brev.dev launchable tutorials
What's Changed
- Bump 3rdparty/Megatron-LM from
2a9793dtoa0365bcby @dependabot in #692 - Bump 3rdparty/NeMo from
48f10aftoee28bc5by @dependabot in #693 - add announcement README.md by @ntadimeti in #695
- Adjust ESM2 fine-tuning to allow NVFlare usecases by @farhadrgh in #689
- Upgrade bionemo-moco to v0.0.2 by @nvdreidenbach in #688
- disable metric when model parallel by @sichu2023 in #701
- bump NeMo by @farhadrgh in #703
- split trufflehog scan into two actions, run on entire repo on scheduled event by @pstjohn in #696
- cve vulnerability on main by @dorotat-nv in #709
- move trufflehog scan to new action by @pstjohn in #721
- Pstjohn/trufflehog move action 2 by @pstjohn in #722
- Evo2 by @jstjohn in #694
- Trigger and skip trufflehog scan in merge group by @pstjohn in #728
- remove zstandard to address nvbug 5149698 by @pstjohn in #726
- JET for evo2: 1b model training by @dorotat-nv in #727
- If desired, training can be stopped on a specific step without impacting the LR curve. by @jstjohn in #739
- Cleanup any new files made by notebook tests by @jstjohn in #748
- Adding bf16 fine-tuned variant of evo2 1b checkpoint by @jstjohn in #747
- Bump nemo version to have the 1b checkpoint fix by @jstjohn in #729
- GTC Evo2 Demo Notebooks by @jwilber in #724
- [cye/subpack-ci] Add sub-package build, test, and publish to OSS. (WORK IN PROGRESS - PENDING MORE SUB-PACKAGE COVERAGE) by @cspades in #725
- Disable notebook and slow tests from running in merge queue by @pstjohn in #754
- fix: removes BIONEMO_HOME from repository [JIRA-BIONEMO-482] by @jomitchellnv in #742
- Update brev.dev badges to launchable built off main branch by @jwilber in #752
- Evo2 modelcard by @jstjohn in #746
- [cye/fix-subpack-ci] Fix bug where workflow dispatch collected packages are not passed to the next job. by @cspades in #753
- reduced mem to 12gb by @nvdreidenbach in #730
- Initial commits prepping for nv-gha-runners by @pstjohn in #733
- xfail evo2 long context train test by @dorotat-nv in #732
New Contributors
- @ntadimeti made their first contribution in #695
Full Changelog: v2.4.1...v2.5