Skip to content

Simd v6.2.159

Latest

Choose a tag to compare

@ermig1979 ermig1979 released this 03 Mar 09:15
· 23 commits to master since this release

Algorithms

New features
  • Base implementation, AMX-BF16 optimizations of class SynetConvolution16bNhwcSpecV2.
  • Support of SimdSynetUnaryOperation32fRound in function SynetUnaryOperation32f.
  • Support of SimdSynetUnaryOperation32fSign in function SynetUnaryOperation32f.
Bug fixing
  • Error in AMX-BF16 optimizations of class SynetConvolution16bNhwcGemmV1 (kernel Convolution16bNhwcGemm_Macro32x32).
  • Error in Base implementation of class SynetQuantizedConvolutionNhwcDepthwiseV2 (multithread using of SimdSynetQuantizedConvolutionForward).
  • Error in Base implementation of class SynetQuantizedConvolutionNhwcDepthwiseV3 (multithread using of SimdSynetQuantizedConvolutionForward).
  • Error in AMX-BF16 optimizations of class SynetConvolution16bNhwcGemmV1 (Inv2x2, kernel Convolution16bNhwcGemm_MacroNx32, unaligned dstH*dstW).
  • Error in AMX-BF16 optimizations of class SynetConvolution16bNhwcGemmV1 (Inv2x2, kernel Convolution16bNhwcGemm_MacroNx32, unaligned dstC).
  • Error in SSE4.1, AVX2, AVX-512BW, NEON optimizations of function AbsDifference (wrong alignment checking).
  • Error in AVX-512BW optimizations of class SynetConvolution32fGemmNN (case of extra large padding).
  • Error in AVX-512BW optimizations of class ResizerNearest.
  • Error in AMX-BF16 optimizations of class SynetMergedConvolution16bCdc (batch > 1, small input size).
  • Error in functions BodyH, BodyW (file SimdSynetConvParam.h).