aboutsummaryrefslogtreecommitdiff
path: root/src/cpu/kernels
AgeCommit message (Expand)Author
2023-04-26Integrate multi-threaded pretranspose_B_arraySiCong Li
2023-04-19NETranspose 8x8 kernel for 32-bit elementsEthan Doe
2023-04-17Fix unhandled case in ElementwiseUnaryRamy Elgammal
2023-04-11Enable quantized data types for CpuElementwiseUnary on Armv7aRamy Elgammal
2023-03-29Fix GCC13 compiler errorsPablo Marquez Tello
2023-03-29Add quantized support for unary elementwise in CPUViet-Hoa Do
2023-03-23Round to nearest with ties to away from zero in ReluPablo Marquez Tello
2023-03-01Add support for kernel indices in MaxpoolAdnan AlSinan
2023-02-28Add an option to use lowest for max-poolingAdnan AlSinan
2023-02-08Update CPU kernels to remove x19 and w19Michael Tyler
2023-02-08Add support for dilation > 1 in assembly DepthwiseConvolutionPablo Marquez Tello
2023-02-03Fix armv7a failing GEMMConvolutionLayer testsMohammed Suhail Munshi
2023-02-01Fix GEMMLowp/Batched MatMul mismatches on CPUMohammed Suhail Munshi
2023-02-01Add new operator AddMulAdd for Neon™ backend for Float/Quantized typesGunes Bayir
2023-01-18Revert "Update CPU kernels to remove x19"Michael Tyler
2023-01-16Update CPU kernels to remove x19Michael Tyler
2023-01-11Deprecated BF16 support in DepthConvertPablo Marquez Tello
2022-12-29Use CPU quantized addition kernel for quantized subtractionOmar Al Khatib
2022-12-21Fixed various mismatches in CpuCastKernelPablo Marquez Tello
2022-11-30Fix build error for unused variables in data type specific buildsGunes Bayir
2022-11-23ONCPUML-1072: Remove double definition of get_mws for Mul kernelfadara01
2022-11-22ONCPUML-1072: Tuned MWS values (for N1, V1) for binary operators used by oneDNNFadi Arafeh
2022-11-15Fix regression caused by mws in ActivationLayerMohammed Suhail Munshi
2022-11-15Fixed Arm NN unit test failure caused by quantised multiplication patch.Omar Al Khatib
2022-11-09Fix CPU multiplication layer threading overheadViet-Hoa Do
2022-11-08SVE Hard-Swish via Lookup table for quantized inputPablo Marquez Tello
2022-11-07Optimize CPU mul layer on quantized dataOmar Al Khatib
2022-11-01Fix fixed-point quantized additionViet-Hoa Do
2022-11-01Add threshold for floating-point SOFT_RELU activationMilos Puzovic
2022-10-27Fix fixed-point quantized additionViet-Hoa Do
2022-10-20Update reinterpret tensor as 1D for CPU addViet-Hoa Do
2022-10-20Add test in GEMMLowp for batch matmulMohammed Suhail Munshi
2022-10-19Fix FFTConvolutionLayer testViet-Hoa Do
2022-10-12Optimize Neon™ Logistic ActivationMohammed Suhail Munshi
2022-10-10Fix LUT-based activation layerViet-Hoa Do
2022-10-07Optimize Neon™ SUB operator by squashing execution windowJakub Sujak
2022-10-03Fix Batch Matmul nightly failureAdnan AlSinan
2022-10-03Optimize CPU add layer on quantized dataViet-Hoa Do
2022-09-26Add FP32 Neon™ swish activationJonathan Deakin
2022-09-22Fix unresolved symbol for target armv7a + AndroidPablo Marquez Tello
2022-09-16Fix bug in QASYMM8_SIGNED to F32 cast layerViet-Hoa Do
2022-09-16Optimize Quantized/Integer Bilinear Scale for Neon™Gunes Bayir
2022-09-14Interpreting tensor as 1D for CPU multiplicationViet-Hoa Do
2022-09-14Adding GELU activationMurray Kornelsen
2022-09-14INT8 Quantized MeanStdDevNorm (LayerNorm)Murray Kornelsen
2022-09-09Optimize FP32/16 Bilinear Scale Kernel for Neon™Gunes Bayir
2022-09-02F16 Specialization for MeanStdDevNormMurray Kornelsen
2022-08-24Fix add for tensors with non-matching stridesJonathan Deakin
2022-08-18Use Neon™ kernels for FP Bilinear Resize for SVEGunes Bayir
2022-08-17Add LUT for quantized sigmoid functionViet-Hoa Do