Skip to content

Simd v5.0.115

Compare
Choose a tag to compare
@ermig1979 ermig1979 released this 01 Jul 16:22
· 996 commits to master since this release

Algorithms

New features
  • Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512BF16, AMX optimizations of class SynetMergedConvolution32fBf16Cdc.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512BF16, AMX optimizations of class SynetMergedConvolution32fBf16Cd.
  • Base implementation, SSE4.1, AVX2, AVX-512BW, AVX-512BF16, AMX optimizations of class SynetMergedConvolution32fBf16Dc.
  • AVX-512BF16 extension support.
  • AVX-512BF16 optimizations of function Float32ToBFloat16.
  • AVX-512BF16, AMX optimizations of class SynetConvolution32fBf16Nhwc.
  • AMX extension support.
  • Support of 3D pooling in Base implementation, SSE4.1, AVX2, AVX-512BW optimizations of function SynetPoolingMax32f.
Improving
  • AVX-512BW optimizations of function Fill32f.
Renaming
  • Rename function SynetPoolingForwardAverage to SynetPoolingAverage.
  • Rename function SynetPoolingForwardMax32f to SynetPoolingMax32f.
  • Rename function SynetPoolingForwardMax8u to SynetPoolingMax8u.
Replacing
  • Replace AVX-512F optimizations to AVX-512BW for function SvmSumLinear.
  • Replace AVX-512F optimizations to AVX-512BW for function Fill32f.
  • Replace AVX-512F optimizations to AVX-512BW for class ResizerNearest.
  • Replace AVX-512F optimizations to AVX-512BW for class ResizerFloatBilinear.
  • Replace AVX-512F optimizations to AVX-512BW for function SquaredDifferenceSum32f.
  • Replace AVX-512F optimizations to AVX-512BW for function SquaredDifferenceKahanSum32f.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralConvolutionForward.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralAddConvolution2x2Forward.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralAddConvolution2x2Backward.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralAddConvolution2x2Sum.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralAddConvolution3x3Forward.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralAddConvolution3x3Backward.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralAddConvolution3x3Sum.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralAddConvolution4x4Forward.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralAddConvolution4x4Backward.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralAddConvolution4x4Sum.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralAddConvolution5x5Forward.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralAddConvolution5x5Backward.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralAddConvolution5x5Sum.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralProductSum.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralAdaptiveGradientUpdate.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralPooling1x1Max3x3.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralPooling2x2Max2x2.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralPooling2x2Max3x3.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralUpdateWeights.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralAddValue.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralAddVector.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralAddVectorMultipliedByValue.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralRoughSigmoid.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralRoughSigmoid2.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralDerivativeSigmoid.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralRoughTanh.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralDerivativeTanh.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralDerivativeRelu.
  • Replace AVX-512F optimizations to AVX-512BW for function NeuralPow.
  • Replace AVX-512F optimizations to AVX-512BW for class SynetConvolution32fGemmNN.
  • Replace AVX-512F optimizations to AVX-512BW for class SynetConvolution32fGemmNT.
  • Replace AVX-512F optimizations to AVX-512BW for class SynetConvolution32fWinograd.
  • Replace AVX-512F optimizations to AVX-512BW for class SynetDeconvolution32fGemmNN.
  • Replace AVX-512F optimizations to AVX-512BW for class SynetDeconvolution32fNhwcDirect2x2.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetDeconvolution32fInit.
  • Replace AVX-512F optimizations to AVX-512BW for class SynetInnerProduct32fGemm.
  • Replace AVX-512F optimizations to AVX-512BW for class SynetInnerProduct32fProd.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetInnerProduct32fInit.
  • Replace AVX-512F optimizations to AVX-512BW for function ConvolutionBiasAndActivation.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetReorderImage.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetReorderFilter.
  • Replace AVX-512F optimizations to AVX-512BW for function Gemm32fNN.
  • Replace AVX-512F optimizations to AVX-512BW for function Gemm32fNT.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetFusedLayerForward0.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetFusedLayerForward1.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetFusedLayerForward2.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetFusedLayerForward3.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetFusedLayerForward4.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetFusedLayerForward8.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetFusedLayerForward9.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel1x3Block1x4SetFilter.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel1x3Block1x4SetInput.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel1x3Block1x4SetOutput.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel1x5Block1x4SetFilter.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel1x5Block1x4SetInput.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel1x5Block1x4SetOutput.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel2x2Block2x2SetFilter.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel2x2Block2x2SetInput.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel2x2Block2x2SetOutput.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel2x2Block4x4SetFilter.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel2x2Block4x4SetInput.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel2x2Block4x4SetOutput.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel3x3Block2x2SetFilter.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel3x3Block2x2SetInput.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel3x3Block2x2SetOutput.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel3x3Block3x3SetFilter.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel3x3Block3x3SetInput.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel3x3Block3x3SetOutput.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel3x3Block4x4SetFilter.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel3x3Block4x4SetInput.
  • Replace AVX-512F optimizations to AVX-512BW for function WinogradKernel3x3Block4x4SetOutput.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetElu32f.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetHardSigmoid32f.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetHswish32f.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetMish32f.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetPreluLayerForward.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetRelu32f.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetRestrictRange32f.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetSigmoid32f.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetSoftplus32f.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetSwish32f.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetTanh32f.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetScaleLayerForward.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetPoolingAverage.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetAddBias.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetEltwiseLayerForward.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetInnerProductLayerForward.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetLrnLayerCrossChannels.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetShuffleLayerForward.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetSoftmaxLayerForward.
  • Replace AVX-512F optimizations to AVX-512BW for function SynetUnaryOperation32fLayerForward.
  • Replace AVX-512F optimizations to AVX-512BW for class SynetConvolution32fDirectNchw.
  • Replace AVX-512F optimizations to AVX-512BW for class SynetConvolution32fDirectNhwc.
  • Replace AVX-512F optimizations to AVX-512BW for class SynetConvolution32fNhwcDirect.
  • Replace AVX-512F optimizations to AVX-512BW for class SynetMergedConvolution32fCdc.
  • Replace AVX-512F optimizations to AVX-512BW for class SynetMergedConvolution32fCd.
  • Replace AVX-512F optimizations to AVX-512BW for class SynetMergedConvolution32fDc.

Infrastructure

New features
  • Project Avx512bf16 for Microsoft Visual Studio 2022.
  • Project Avx512bf16 for Microsoft Visual Studio 2019.
  • Project Amx for Microsoft Visual Studio 2022.
  • Project Amx for Microsoft Visual Studio 2019.
Removing
  • Project Avx512f for Microsoft Visual Studio 2022.
  • Project Avx512f for Microsoft Visual Studio 2019.