Skip to content

Simd v5.2.122

Choose a tag to compare

@ermig1979 ermig1979 released this 01 Feb 14:08
· 2122 commits to master since this release

Algorithms

New features
  • New API of function Avx512bw::TileZero (AMX emulation).
  • New API of function Avx512bw::TileLoad (AMX emulation).
  • New API of function Avx512bw::TileStore (AMX emulation).
  • New API of function Avx512bw::TileMatMulBf16 (AMX emulation).
  • New API of function Avx512bw::TileMatMul8u8i (AMX emulation).
  • Function Avx512bw::TileMatMulFp16(AMX emulation).
  • The mark of function SimdInterferenceIncrement as deprecated.
  • The mark of function SimdInterferenceIncrementMasked as deprecated.
  • The mark of function SimdInterferenceDecrement as deprecated.
  • The mark of function SimdInterferenceDecrementMasked as deprecated.
  • The mark of function SimdSynetReorderImage as deprecated.
  • The mark of function SimdSynetReorderFilter as deprecated.
  • SimdTensorData16f (16-bit floating point) tensor type.
  • The mark of function SimdSynetSpecifyTensorFormat as deprecated.
  • The mark of function SimdSynetTensorAlignment as deprecated.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, NEON optimizations of SynetPermute engine.
  • NEON optimizations of function Yuva444pToBgraV2.
  • NEON optimizations of function AlphaBlending2x.
  • SSE4.1, AVX2, AVX-512BW optimizations of function BgraToYuv444pV2.
  • SSE4.1, AVX2, AVX-512BW optimizations of function BgraToYuv420pV2.
  • Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function AlphaBlendingBgraToYuv420p.
  • Parameter 'copy' to View::Capure.
  • Method Array::Release.
Improving
  • NEON optimizations of function TransformImage.
Bug fixing
  • Clang compiler error in function Simd::WarpAffine.
  • MSVS-2022 compiler warnings in file SimdBaseRecursiveBilateralFilter.cpp.
  • MSVS-2022 compiler warnings in file SimdSse41RecursiveBilateralFilter.cpp.
  • MSVS-2015 compiler error in file SimdAvx2RecursiveBilateralFilter.cpp.
  • Error in method MergConvParam32f::Valid.
  • Crash in constructor of Simd::TileConf.
  • Crash in AVX and AVX2 optimizations of function SynetInnerProductLayerForward.
  • MSVS-2022 compiler error in file SimdAvx2RecursiveBilateralFilter.cpp (Win32 target).
  • GCC compiler error in file SimdParallel.hpp (for AVX2 optimizations).

Test framework

New features
  • Tests for verifying functionality of SynetPermut engine.
  • Tests for verifying functionality of function AlphaBlendingBgraToYuv420p.
Bug fixing
  • Crash in test GaussianBlurAutoTest.

Infrastructure

New features
  • Install target in Cmake.
  • Uninstall target in Cmake.
Renaming
  • Project Amx to AmxBf16.