Releases · ARM-software/Tool-Solutions · GitHub

14 Mar 15:13

nSircombe

Tool Solutions: ML Frameworks r25.03

PyTorch build

https://github.com/ARM-software/Tool-Solutions/blob/r25.03/ML-Frameworks/pytorch-aarch64/README.md

Added

Adds work-in-progress PyTorch PRs:
- pytorch/pytorch#148542 - Enables direct use Compute Library in ATen.
- pytorch/pytorch#147337 - Enables a fast path for static qlinear via Compute Library directly.
- pytorch/pytorch#146620 - Enables qint8 and quint8 add via Compute Library directly. Speedup for OMP_NUM_THREADS=1 is ~15x, and ~5.4x for 32 threads.
- pytorch/pytorch#148197 - Enables oneDNN dispatch for GEMM bf16bf16->bf16.
- pytorch/pytorch#140159 - Enables gemm-bf16f32/
Adds work-in-progress oneDNN PRs:
- uxlfoundation/oneDNN#2838 - Dispatches fpmath_mode::bf16 conv to Compute Library.

Changed

Updates hashes for:
- PyTorch to e555c4d (2.7.0.dev20250305) from viable/strict branch.
- ideep to 719d8e6 from ideep_pytorch branch.
- oneDNN to 321c452 from main branch.
- Compute Library to v25.02.1.
- OpenBLAS to ef9e3f7 from main.
Updates work-in-progress PyTorch PRs.
Updates torchaudio to 2.6.0.dev20250305.
Updates torchvision to 0.22.0.dev20250305.
Dockerfile now upgrades pip before installing Python packages.
git-shallow-clone function now supports cloning by tag as well as hash.

Removed

Removes patches which have now been merged into the upstream branches.
Removes static_quantize_conv example since pytorch/pytorch#141127 is no longer included in the build.

TensorFlow build

https://github.com/ARM-software/Tool-Solutions/blob/r25.03/ML-Frameworks/tensorflow-aarch64/README.md
This release establishes a new TensorFlow build based on the approach taken with PyTorch builds.

Changed

Updated TensorFlow to 2.20.dev-6506da6

Removed

Removed MLCommons examples and patches
Removed cpp examples
Removed all inline patches, work in progress features should now be applied
from PRs using wget in ./get-source.sh.

Complete list of PRs:

TensorFlow refactor: Build wheel + install in docker by @davsva01 in #293
Update openblas hash by @aditew01 in #294
Add test script for TensorFlow by @davsva01 in #295
Remove bumping of oneDNN and ACL by @davsva01 in #296
Updates PyTorch build. by @nSircombe in #297
Fixes PyTorch build. by @nSircombe in #298
Fixes mistake in OpenBLAS build by @nSircombe in #299
Fixes OpenBLAS PR5157 hash. by @nSircombe in #300
Updates for r25.03 by @nSircombe in #302
Set weights_only=False in torch.load by @nSircombe in #303
Fix broken links and incorrect docs by @davsva01 in #304
Removes static_quatize_conv example and adds fpmath_mode::bf16 Compute Library dispatch patch by @nSircombe in #305
Adds --force to git fetch command by @nSircombe in #306

Full Diff: pytorch-aarch64--r25.02...r25.03

Contributors

nSircombe, davsva01, and aditew01

Assets 2