aboutsummaryrefslogtreecommitdiff
path: root/src/core/NEON
AgeCommit message (Expand)Author
2 daysfix compilation errors on linux with gcc12Sunita Nadampalli
3 daysAdd s8f32 kernels and dynamic QuantizationInfoJonathan Deakin
7 daysAdd in place summation to CPU GEMM kernelsRadu Salavat
2024-03-27Added new NEON fixed format fast math mode hybrid kernel with maximum height ...Milos Puzovic
2024-03-21[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorc...Renato Arantes
2024-03-18Fix quant. gemv kernel driver by adding set_quantized_bias()Gunes Bayir
2024-03-14arm_gemm: Fix bias handling for sme2 FP16 GEMV.David Mansell
2024-03-12Fix WoA nightly failurePablo Marquez Tello
2024-02-22Fix segfault in DWC in WoAPablo Marquez Tello
2024-02-22Fix OpenBSDĀ® build failure caused by patch 11144Gunes Bayir
2024-02-21Integrate new pretranspose_b_array with extra fused transpose of BGunes Bayir
2024-02-14Fix compiler errors in cl-clangPablo Marquez Tello
2024-02-07Parallelize CPU depthwise over batch if only 1 rowJonathan Deakin
2024-02-06arm_gemm: SME: Remove artificial single-thread constraint on quantized int8 k...David Mansell
2024-01-25arm_gemm: convolution: optimize convolver.hpp.David Mansell
2024-01-12[ONCPUML-1387] Add ACL based reorder for f32 to bf16 data type conversion.Renato Arantes
2023-12-07Optimize CPU depth-to-spaceViet-Hoa Do
2023-12-05Optimize CpuSoftmaxKernel for axis=0Gunes Bayir
2023-11-27BatchNorm changes to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-11-16NormalizationLayer changes to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-11-09Remove duplicate definitions of BF16 fixed format kernels.David Mansell
2023-11-01Fix compilation error with clang and multi-isaViet-Hoa Do
2023-10-31Fix SVE kernel using SVE2 instructionViet-Hoa Do
2023-10-17arm_gemm: Add SME2 FP16 GEMV using FP16->FP32 dot product.David Mansell
2023-10-17Revert "arm_gemm: Add SME2 FP16 GEMV."David Mansell
2023-10-12arm_gemm: Add SME2 FP16 GEMV.David Mansell
2023-10-10arm_gemm: fix 2D threading mode for SME2David Mansell
2023-10-10Fix NEReorderKernel validationDavid Svantesson
2023-10-10Optimize NEStackLayerGunes Bayir
2023-10-03Fix nightly NEON Reverse reference failureAdnan AlSinan
2023-09-28Revise clang-format configurationJakub Sujak
2023-09-28Reimplement erf functionViet-Hoa Do
2023-09-28Apply clang-format on repositoryFelix Thomasmathibalan
2023-09-27Implement tflite compliant reverse for CPUAdnan AlSinan
2023-09-15Fix include dependencies for mass reformatting patchGunes Bayir
2023-08-29GEMM: AArch32: Split assembler block in a32_merge_float_8x6.hppDavid Mansell
2023-08-23Update CpuGemmConv2d and CpuFlatten to use CpuReshape operatorAnitha Raj
2023-08-22CPU: Depthwise: Generate correct size for input indirection array.David Mansell
2023-08-17Fix various static check issuesViet-Hoa Do
2023-07-28Retain back-compatibility for arm_compute/core/Types.hSiCong Li
2023-07-18Break up core/Utils.h to reduce unused code being included everywhereMatthew Bentham
2023-07-13Enable premultiplication for depthwise convolutionMichael Tyler
2023-07-05Fix unused function warningMichael Tyler
2023-07-04Depthwise channel pre-multiplicationMichael Tyler
2023-06-23Fix doxygen warningsramy.elgammal@arm.com
2023-06-21Enable vmfa in arm7va/aarch32 when presentPablo Marquez Tello
2023-06-15Break up Utils.h a bit to reduce unused code being included everywhereMatthew Bentham
2023-06-15Break up arm_compute/core/Types.h a bitMatthew Bentham
2023-06-07Fix build error for armv7aPablo Marquez Tello
2023-06-07Fix guards for FP16 depthwise kernelsMichael Tyler