aboutsummaryrefslogtreecommitdiff
path: root/src/core/NEON/kernels
AgeCommit message (Expand)Author
2023-08-29GEMM: AArch32: Split assembler block in a32_merge_float_8x6.hppDavid Mansell
2023-08-23Update CpuGemmConv2d and CpuFlatten to use CpuReshape operatorAnitha Raj
2023-08-22CPU: Depthwise: Generate correct size for input indirection array.David Mansell
2023-08-17Fix various static check issuesViet-Hoa Do
2023-07-28Retain back-compatibility for arm_compute/core/Types.hSiCong Li
2023-07-18Break up core/Utils.h to reduce unused code being included everywhereMatthew Bentham
2023-07-13Enable premultiplication for depthwise convolutionMichael Tyler
2023-07-05Fix unused function warningMichael Tyler
2023-07-04Depthwise channel pre-multiplicationMichael Tyler
2023-06-23Fix doxygen warningsramy.elgammal@arm.com
2023-06-15Break up Utils.h a bit to reduce unused code being included everywhereMatthew Bentham
2023-06-15Break up arm_compute/core/Types.h a bitMatthew Bentham
2023-06-07Fix build error for armv7aPablo Marquez Tello
2023-06-07Fix guards for FP16 depthwise kernelsMichael Tyler
2023-06-05Update CPU kernel implementations and guard directivesMichael Tyler
2023-05-03Guards to make NEReorder aarch64 onlyDavid Svantesson
2023-05-03Update a64_transpose_interleave_16.hppDavid Svantesson
2023-04-28Reorder addedDavid Svantesson
2023-04-28Fix the gather layer indices checkViet-Hoa Do
2023-04-26Add FP16 depthwise kernels for SME2David Mansell
2023-04-26Integrate multi-threaded pretranspose_B_arraySiCong Li
2023-03-30Add cropping support to NEBatchToSpaceSiCong Li
2023-03-29Fix GCC13 compiler errorsPablo Marquez Tello
2023-03-27Fix BatchToSpaceFixtureSiCong Li
2023-03-21gemm_interleaved: Set up the accumulation buffer properly in alternateDavid Mansell
2023-03-13arm_gemm: Add SME2 FP16 kernels.David Mansell
2023-03-08Add support for arbitrary parameters for CPU GatherViet-Hoa Do
2023-03-07GEMM: SME: Allow threading for quantized GEMMs.David Mansell
2023-02-15Fix Intermittent Neon™ ReduceMean QASYMM8 MismatchMohammed Suhail Munshi
2023-02-08Update CPU kernels to remove x19 and w19Michael Tyler
2023-02-08Add support for dilation > 1 in assembly DepthwiseConvolutionPablo Marquez Tello
2023-01-31Fixed clang-cl linker errorsPablo Tello
2023-01-20Improve the strided_slice layer on all data typesOmar Al Khatib
2023-01-18Revert "Update CPU kernels to remove x19"Michael Tyler
2023-01-16Update CPU kernels to remove x19Michael Tyler
2023-01-13Remove unused code in arm_conv/depthwise/Pablo Marquez Tello
2022-12-28Fix company name on copyright noticeViet-Hoa Do
2022-12-28Fix various compilation errorsViet-Hoa Do
2022-12-21Optimize MeanReduce by integer acc. and removing upfront dequant.Omar Al Khatib
2022-11-28Integrate SME2 kernelsViet-Hoa Do
2022-09-23CPU GEMM: Fix overreads in SVE merges.David Mansell
2022-09-14INT8 Quantized MeanStdDevNorm (LayerNorm)Murray Kornelsen
2022-08-08Fix for AI benchmark ResNet regressionViet-Hoa Do
2022-08-04[ONCPUML-970] Fast math mode for fixed format kernelsPablo Marquez Tello
2022-07-19[ONCPUML-951] Variable weight support for Convolution.Francesco Petrogalli
2022-07-18Fix Neoverse V1 heuristics for FP32 fast moderamelg01
2022-07-14Integrate new winograd APIs from MLTechramelg01
2022-07-13Fixed clang-cl errors on Windows native builds.Pablo Tello
2022-07-01Fix OpenBSD build errorsPablo Marquez Tello
2022-06-13Add support for 2d and 3d indices for axis 1Pablo Marquez Tello