File tree 1 file changed +4
-0
lines changed
1 file changed +4
-0
lines changed Original file line number Diff line number Diff line change 29
29
* Improved ` fp16 ` reorder performance.
30
30
* Improved ` int8 ` matmul performance.
31
31
* Improved ` bf16 ` inner product forward propagation performance with Arm Compute Library (ACL).
32
+ * Improved ` bf16 ` eltwise performance.
32
33
* Improved convolution performance on processors with SVE support with ACL.
33
34
34
35
# Functionality
68
69
* Enabled support for ROCm 6 on AMD GPUs.
69
70
* Improved CMake integration for oneDNN installation with Nvidia backend enabled.
70
71
* Reduced memory footprint for matmul primitive when using ACL.
72
+ * Fixed correctness issues in ` fp32 ` reorders when using ACL.
73
+ * Fixed sporadic crash in matmuls when using ACL with threadpool scheduler.
74
+ * Fixed sporadic correctness issue in convolutions with ACL.
71
75
72
76
[ `ONEDNN_ENABLE_PRIMITIVE` ] : https://uxlfoundation.github.io/oneDNN/v3.8/dev_guide_build_options.html#onednn-enable-primitive
73
77
You can’t perform that action at this time.
0 commit comments