aboutsummaryrefslogtreecommitdiff
path: root/src/core/NEON/kernels
AgeCommit message (Expand)Author
27 hoursAdded new NEON fixed format fast math mode hybrid kernel with maximum height ...HEADrelease_candidatemainMilos Puzovic
7 days[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorc...Renato Arantes
10 daysFix quant. gemv kernel driver by adding set_quantized_bias()Gunes Bayir
14 daysarm_gemm: Fix bias handling for sme2 FP16 GEMV.David Mansell
2024-03-12Fix WoA nightly failurePablo Marquez Tello
2024-02-22Fix segfault in DWC in WoAPablo Marquez Tello
2024-02-22Fix OpenBSDĀ® build failure caused by patch 11144Gunes Bayir
2024-02-21Integrate new pretranspose_b_array with extra fused transpose of BGunes Bayir
2024-02-14Fix compiler errors in cl-clangPablo Marquez Tello
2024-02-07Parallelize CPU depthwise over batch if only 1 rowJonathan Deakin
2024-02-06arm_gemm: SME: Remove artificial single-thread constraint on quantized int8 k...David Mansell
2024-01-25arm_gemm: convolution: optimize convolver.hpp.David Mansell
2024-01-12[ONCPUML-1387] Add ACL based reorder for f32 to bf16 data type conversion.Renato Arantes
2023-12-07Optimize CPU depth-to-spaceViet-Hoa Do
2023-11-27BatchNorm changes to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-11-16NormalizationLayer changes to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-11-09Remove duplicate definitions of BF16 fixed format kernels.David Mansell
2023-10-17arm_gemm: Add SME2 FP16 GEMV using FP16->FP32 dot product.David Mansell
2023-10-17Revert "arm_gemm: Add SME2 FP16 GEMV."David Mansell
2023-10-12arm_gemm: Add SME2 FP16 GEMV.David Mansell
2023-10-10arm_gemm: fix 2D threading mode for SME2David Mansell
2023-10-10Fix NEReorderKernel validationDavid Svantesson
2023-10-10Optimize NEStackLayerGunes Bayir
2023-10-03Fix nightly NEON Reverse reference failureAdnan AlSinan
2023-09-28Apply clang-format on repositoryFelix Thomasmathibalan
2023-09-27Implement tflite compliant reverse for CPUAdnan AlSinan
2023-09-15Fix include dependencies for mass reformatting patchGunes Bayir
2023-08-29GEMM: AArch32: Split assembler block in a32_merge_float_8x6.hppDavid Mansell
2023-08-23Update CpuGemmConv2d and CpuFlatten to use CpuReshape operatorAnitha Raj
2023-08-22CPU: Depthwise: Generate correct size for input indirection array.David Mansell
2023-08-17Fix various static check issuesViet-Hoa Do
2023-07-28Retain back-compatibility for arm_compute/core/Types.hSiCong Li
2023-07-18Break up core/Utils.h to reduce unused code being included everywhereMatthew Bentham
2023-07-13Enable premultiplication for depthwise convolutionMichael Tyler
2023-07-05Fix unused function warningMichael Tyler
2023-07-04Depthwise channel pre-multiplicationMichael Tyler
2023-06-23Fix doxygen warningsramy.elgammal@arm.com
2023-06-15Break up Utils.h a bit to reduce unused code being included everywhereMatthew Bentham
2023-06-15Break up arm_compute/core/Types.h a bitMatthew Bentham
2023-06-07Fix build error for armv7aPablo Marquez Tello
2023-06-07Fix guards for FP16 depthwise kernelsMichael Tyler
2023-06-05Update CPU kernel implementations and guard directivesMichael Tyler
2023-05-03Guards to make NEReorder aarch64 onlyDavid Svantesson
2023-05-03Update a64_transpose_interleave_16.hppDavid Svantesson
2023-04-28Reorder addedDavid Svantesson
2023-04-28Fix the gather layer indices checkViet-Hoa Do
2023-04-26Add FP16 depthwise kernels for SME2David Mansell
2023-04-26Integrate multi-threaded pretranspose_B_arraySiCong Li
2023-03-30Add cropping support to NEBatchToSpaceSiCong Li
2023-03-29Fix GCC13 compiler errorsPablo Marquez Tello