You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This PR updates the README to clarify minimum hardware architecture requirements for different data types. The documentation now specifies which instruction sets are needed for each function type (f32, bf16, int8).
Signed-off-by: Vishal <Vishal.Akula@amd.com>
| f32 (float) | AVX2/FMA3 | AMD Zen1 and newer, Intel Haswell and newer |
115
+
| bf16 (bfloat16) | AVX2/FMA3 | AMD Zen1 and newer, Intel Haswell and newer |
116
+
| ↳ | AVX512_BF16 (optimal) | AMD Zen4 and newer, Intel Cooper Lake and newer |
117
+
| int8 (int8, uint8) | AVX512_VNNI | AMD Zen4 and newer, Intel Cascade Lake and newer |
118
+
119
+
While optimized for AMD processors, the library is compatible with any x86_64 CPU that meets these minimum requirements. For best performance on AMD processors, it is recommended to use Zen4 or newer architectures which support all instruction sets.
0 commit comments