You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- Replace google-perftool bundle with specific libtcmalloc-minimal4 and add symlink so that you can still use it with `LD_PRELOAD=/usr/lib/aarch64-linux-gnu/libtcmalloc.so.4`.
45
-
- Remove unused git apt package.
46
-
- Remove outdated versions of pip/setuptools/wheel from image after pip updates itself.
47
-
- Remove manual install of transformers in `transformers_llm_text_gen.py`, we now use a single version everywhere.
- Replace google-perftool bundle with specific libtcmalloc-minimal4 and add symlink so that you can still use it with `LD_PRELOAD=/usr/lib/aarch64-linux-gnu/libtcmalloc.so.4`.
59
+
- Remove unused git apt package.
60
+
- Remove outdated versions of pip/setuptools/wheel from image after pip updates itself.
61
+
- Remove manual install of transformers in `transformers_llm_text_gen.py`, we now use a single version everywhere.
- Adds https://github.com/pytorch/pytorch/pull/159859, a WIP LUT implmentation of bf16 GELU
57
-
this gives an ~8x speedup on GELU and an ~1.8x speedup for attention for llama3.2 11B Vision (both on 16 threads).
58
-
- Adds https://github.com/pytorch/pytorch/pull/158250, to integrate INT4->BF16 via KleidiAI, with fallback.
59
-
- Adds https://github.com/pytorch/pytorch/pull/160080, a VLA PoC for PyTorch, and
60
-
https://github.com/pytorch/pytorch/pull/161049, an optimised SVE exp_u20 implementation,
61
-
Note: there may be some regressions on Neoverse-V1 with this WIP patch.
62
-
- Adds a new example script llama_vision_instruct.py to run and benchmark
63
-
Llama-3.2-11B-Vision-Instruct using text + image input and text output.
70
+
- Adds https://github.com/pytorch/pytorch/pull/159859, a WIP LUT implmentation of bf16 GELU
71
+
this gives an ~8x speedup on GELU and an ~1.8x speedup for attention for llama3.2 11B Vision (both on 16 threads).
72
+
- Adds https://github.com/pytorch/pytorch/pull/158250, to integrate INT4->BF16 via KleidiAI, with fallback.
73
+
- Adds https://github.com/pytorch/pytorch/pull/160080, a VLA PoC for PyTorch, and
74
+
https://github.com/pytorch/pytorch/pull/161049, an optimised SVE exp_u20 implementation,
75
+
Note: there may be some regressions on Neoverse-V1 with this WIP patch.
76
+
- Adds a new example script llama_vision_instruct.py to run and benchmark
77
+
Llama-3.2-11B-Vision-Instruct using text + image input and text output.
64
78
65
79
### Changed
66
-
- Updates hashes for:
67
-
- PYTORCH_HASH to 4e2ddb5db67617f9f5309c8bba0c17adc84cadbc, 2.9.0.dev20250808 from viable/strict, August 8th.
68
-
- IDEEP_HASH to 3527b0bf2127aa2de93810feb6906d173c24037f, from ideep_pytorch, August 1st.
69
-
- ONEDNN_HASH to 7e85b94b5f6be27b83c5435603ab67888b99da32, from main, August 1st.
70
-
- ACL_HASH to 3c32d706d0245dcb55181c8ced526eab05e2ff8d, from main, August 1st.
71
-
- TORCH_AO_HASH to 8d4a5d83d7be4d7807feabe38d37704c92d40900, from main, August 1st.
72
-
- KLEIDIAI_HASH to 8ca226712975f24f13f71d04cda039a0ee9f9e2f, v1.12 from main.
73
-
- Update the examples/transformers_llm_text_gen.py to use the new quantizer api Int8DynamicActivationIntxWeightConfig.
74
-
- Deleted torchchat_llm_text_gen.py.
75
-
- Removed Dockerfile lines cloning TorchChat repo and setting safe.directory.
76
-
- Updates huggingface_hub to 0.34.0.
80
+
- Updates hashes for:
81
+
- PYTORCH_HASH to 4e2ddb5db67617f9f5309c8bba0c17adc84cadbc, 2.9.0.dev20250808 from viable/strict, August 8th.
82
+
- IDEEP_HASH to 3527b0bf2127aa2de93810feb6906d173c24037f, from ideep_pytorch, August 1st.
83
+
- ONEDNN_HASH to 7e85b94b5f6be27b83c5435603ab67888b99da32, from main, August 1st.
84
+
- ACL_HASH to 3c32d706d0245dcb55181c8ced526eab05e2ff8d, from main, August 1st.
85
+
- TORCH_AO_HASH to 8d4a5d83d7be4d7807feabe38d37704c92d40900, from main, August 1st.
86
+
- KLEIDIAI_HASH to 8ca226712975f24f13f71d04cda039a0ee9f9e2f, v1.12 from main.
87
+
- Update the examples/transformers_llm_text_gen.py to use the new quantizer api Int8DynamicActivationIntxWeightConfig.
88
+
- Deleted torchchat_llm_text_gen.py.
89
+
- Removed Dockerfile lines cloning TorchChat repo and setting safe.directory.
90
+
- Updates huggingface_hub to 0.34.0.
77
91
78
92
### Removed
79
-
-https://github.com/pytorch/pytorch/pull/151547, to update OpenBLAS commit as this has been merged upstream.
80
-
- Temporarily removed --compile flag from transformers_llm_text_gen.py example as there is an unresolved issue with torch compile support in this case.
93
+
-https://github.com/pytorch/pytorch/pull/151547, to update OpenBLAS commit as this has been merged upstream.
94
+
- Temporarily removed --compile flag from transformers_llm_text_gen.py example as there is an unresolved issue with torch compile support in this case.
81
95
82
96
### Fixed
83
-
- Updates various Python packages to address known vulnerabilies with a high CVSS score:
84
-
- Updates Transformers to 4.55.2, this also provides a mitigation for CVE-2025-2099.
85
-
- Updates Wheel version to 0.38.0 as a mitigation for CVE-2022-40898.
86
-
- Updates setup-tools to 78.1.1 as a mitigation for CVE-2025-47273 and CVE-2024-6345.
87
-
- Updates Torchvison to 0.23 to avoid the need to use `--extra-index-url`
88
-
this is the recomended mitgation against CVE-2018-20225, affecting all versions of pip.
97
+
- Updates various Python packages to address known vulnerabilies with a high CVSS score:
98
+
- Updates Transformers to 4.55.2, this also provides a mitigation for CVE-2025-2099.
99
+
- Updates Wheel version to 0.38.0 as a mitigation for CVE-2022-40898.
100
+
- Updates setup-tools to 78.1.1 as a mitigation for CVE-2025-47273 and CVE-2024-6345.
101
+
- Updates Torchvison to 0.23 to avoid the need to use `--extra-index-url`
102
+
this is the recomended mitgation against CVE-2018-20225, affecting all versions of pip.
0 commit comments