aboutsummaryrefslogtreecommitdiff
path: root/src/cpu/kernels
AgeCommit message (Expand)Author
2023-09-13Changes in NECropResize to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-09-08Meanstddevnorm changes to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-09-06Changes to BoundingBoxTransform to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-09-06Changes to ElementwiseOp to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-09-04DWC changes to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-09-01Pool3d changes to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-08-30Changes in roi_align to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-08-29NEFuseBatchNormalizationKernel reworkPablo Marquez Tello
2023-08-23CpuAdd rework to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-08-22Optimize CpuReshapeKernelAnitha Raj
2023-08-17Fix depthwise convolution not using assembly kernelViet-Hoa Do
2023-08-08Add support for S64 output in NEArgMinMaxLayerPablo Marquez Tello
2023-07-28Retain back-compatibility for arm_compute/core/Types.hSiCong Li
2023-07-19Add support for input S64/U64 in CpuCastKernelPablo Marquez Tello
2023-07-18Break up core/Utils.h to reduce unused code being included everywhereMatthew Bentham
2023-07-04Depthwise channel pre-multiplicationMichael Tyler
2023-06-26Remove dependency on fp16 definitions from some core include filesMatthew Bentham
2023-06-21Fix CPU depthwise convolution in case of large paddingViet-Hoa Do
2023-06-15Break up Utils.h a bit to reduce unused code being included everywhereMatthew Bentham
2023-06-15Break up arm_compute/core/Types.h a bitMatthew Bentham
2023-06-12Refactor activation LUT computationPablo Marquez Tello
2023-06-06Fix ScaleKernel validate method.Pablo Marquez Tello
2023-05-17Move lut kernel to sve2 categorySiCong Li
2023-05-10Re-enable dyanmic weights in Neon™ depthwise convolutionRamy Elgammal
2023-05-03Fix im2col for fast-maths mode with padding.Renato Arantes
2023-04-26Integrate multi-threaded pretranspose_B_arraySiCong Li
2023-04-19NETranspose 8x8 kernel for 32-bit elementsEthan Doe
2023-04-17Fix unhandled case in ElementwiseUnaryRamy Elgammal
2023-04-11Enable quantized data types for CpuElementwiseUnary on Armv7aRamy Elgammal
2023-03-29Fix GCC13 compiler errorsPablo Marquez Tello
2023-03-29Add quantized support for unary elementwise in CPUViet-Hoa Do
2023-03-23Round to nearest with ties to away from zero in ReluPablo Marquez Tello
2023-03-01Add support for kernel indices in MaxpoolAdnan AlSinan
2023-02-28Add an option to use lowest for max-poolingAdnan AlSinan
2023-02-08Update CPU kernels to remove x19 and w19Michael Tyler
2023-02-08Add support for dilation > 1 in assembly DepthwiseConvolutionPablo Marquez Tello
2023-02-03Fix armv7a failing GEMMConvolutionLayer testsMohammed Suhail Munshi
2023-02-01Fix GEMMLowp/Batched MatMul mismatches on CPUMohammed Suhail Munshi
2023-02-01Add new operator AddMulAdd for Neon™ backend for Float/Quantized typesGunes Bayir
2023-01-18Revert "Update CPU kernels to remove x19"Michael Tyler
2023-01-16Update CPU kernels to remove x19Michael Tyler
2023-01-11Deprecated BF16 support in DepthConvertPablo Marquez Tello
2022-12-29Use CPU quantized addition kernel for quantized subtractionOmar Al Khatib
2022-12-21Fixed various mismatches in CpuCastKernelPablo Marquez Tello
2022-11-30Fix build error for unused variables in data type specific buildsGunes Bayir
2022-11-23ONCPUML-1072: Remove double definition of get_mws for Mul kernelfadara01
2022-11-22ONCPUML-1072: Tuned MWS values (for N1, V1) for binary operators used by oneDNNFadi Arafeh
2022-11-15Fix regression caused by mws in ActivationLayerMohammed Suhail Munshi
2022-11-15Fixed Arm NN unit test failure caused by quantised multiplication patch.Omar Al Khatib
2022-11-09Fix CPU multiplication layer threading overheadViet-Hoa Do