Releases: ARM-software/Tool-Solutions
Releases · ARM-software/Tool-Solutions
Tool Solutions: ML Frameworks r25.03
PyTorch build
https://github.com/ARM-software/Tool-Solutions/blob/r25.03/ML-Frameworks/pytorch-aarch64/README.md
Added
- Adds work-in-progress PyTorch PRs:
- pytorch/pytorch#148542 - Enables direct use Compute Library in ATen.
- pytorch/pytorch#147337 - Enables a fast path for static qlinear via Compute Library directly.
- pytorch/pytorch#146620 - Enables qint8 and quint8 add via Compute Library directly. Speedup for OMP_NUM_THREADS=1 is ~15x, and ~5.4x for 32 threads.
- pytorch/pytorch#148197 - Enables oneDNN dispatch for GEMM bf16bf16->bf16.
- pytorch/pytorch#140159 - Enables gemm-bf16f32/
- Adds work-in-progress oneDNN PRs:
- uxlfoundation/oneDNN#2838 - Dispatches fpmath_mode::bf16 conv to Compute Library.
Changed
- Updates hashes for:
- PyTorch to e555c4d (2.7.0.dev20250305) from viable/strict branch.
- ideep to 719d8e6 from ideep_pytorch branch.
- oneDNN to 321c452 from main branch.
- Compute Library to v25.02.1.
- OpenBLAS to ef9e3f7 from main.
- Updates work-in-progress PyTorch PRs.
- Updates torchaudio to 2.6.0.dev20250305.
- Updates torchvision to 0.22.0.dev20250305.
- Dockerfile now upgrades pip before installing Python packages.
- git-shallow-clone function now supports cloning by tag as well as hash.
Removed
- Removes patches which have now been merged into the upstream branches.
- Removes static_quantize_conv example since pytorch/pytorch#141127 is no longer included in the build.
TensorFlow build
https://github.com/ARM-software/Tool-Solutions/blob/r25.03/ML-Frameworks/tensorflow-aarch64/README.md
This release establishes a new TensorFlow build based on the approach taken with PyTorch builds.
Changed
- Updated TensorFlow to 2.20.dev-6506da6
Removed
- Removed MLCommons examples and patches
- Removed cpp examples
- Removed all inline patches, work in progress features should now be applied
from PRs using wget in./get-source.sh.
Complete list of PRs:
- TensorFlow refactor: Build wheel + install in docker by @davsva01 in #293
- Update openblas hash by @aditew01 in #294
- Add test script for TensorFlow by @davsva01 in #295
- Remove bumping of oneDNN and ACL by @davsva01 in #296
- Updates PyTorch build. by @nSircombe in #297
- Fixes PyTorch build. by @nSircombe in #298
- Fixes mistake in OpenBLAS build by @nSircombe in #299
- Fixes OpenBLAS PR5157 hash. by @nSircombe in #300
- Updates for r25.03 by @nSircombe in #302
- Set weights_only=False in torch.load by @nSircombe in #303
- Fix broken links and incorrect docs by @davsva01 in #304
- Removes static_quatize_conv example and adds fpmath_mode::bf16 Compute Library dispatch patch by @nSircombe in #305
- Adds --force to git fetch command by @nSircombe in #306
Full Diff: pytorch-aarch64--r25.02...r25.03