Skip to content

Simd v6.2.152

Choose a tag to compare

@ermig1979 ermig1979 released this 01 Aug 08:24
· 373 commits to master since this release

Algorithms

New features
  • AVX2, AVX-512BW optimizations of class SynetQuantizedAddUniform.
  • Base implementation of class SynetQuantizedInnerProductRef.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512VNNI, AMX-INT8 optimizations of class SynetQuantizedInnerProductGemmNN.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512VNNI, AMX-INT8 optimizations of class SynetQuantizedConvolutionNhwcSpecV0.
  • Base implementation, SSE4.1, AVX2 optimizations of class SynetQuantizedConvolutionNhwcDepthwise.
Improve
  • AMX-INT8 optimizations of class SynetQuantizedConvolutionNhwcGemm.
Bug fixing
  • Error in NEON optimization of function Float32ToBFloat16.
  • Error in Base implementation of class SynetQuantizedConvolutionNhwcGemm.
  • Error in Base implementation of class SynetQuantizedConvolutionGemm.

Test framework

New features
  • Tests for verifying functionality of SynetQuantizedInnerProduct framework.