aboutsummaryrefslogtreecommitdiff
path: root/src/core/NEON
AgeCommit message (Expand)Author
4 daysMake quantization rounding consistentJonathan Deakin
9 daysarm_gemm: fix SVE check on fast mode kernels.David Mansell
10 daysChange reorder implementation to be vector length agnostic for OHWIo8 reorderRadu Salavat
11 daysNew SME2 heuristics.David Mansell
13 daysDisable SME2 Gemmlowp s8f32 kernel selection in case results needs to be accu...Gunes Bayir
2024-04-26Disable SME2 Gemm kernel selection in case results needs to be accumulatedGunes Bayir
2024-04-16fix compilation errors on linux with gcc12Sunita Nadampalli
2024-04-15Add s8f32 kernels and dynamic QuantizationInfoJonathan Deakin
2024-04-11Add in place summation to CPU GEMM kernelsRadu Salavat
2024-03-27Added new NEON fixed format fast math mode hybrid kernel with maximum height ...Milos Puzovic
2024-03-21[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorc...Renato Arantes
2024-03-18Fix quant. gemv kernel driver by adding set_quantized_bias()Gunes Bayir
2024-03-14arm_gemm: Fix bias handling for sme2 FP16 GEMV.David Mansell
2024-03-12Fix WoA nightly failurePablo Marquez Tello
2024-02-22Fix segfault in DWC in WoAPablo Marquez Tello
2024-02-22Fix OpenBSDĀ® build failure caused by patch 11144Gunes Bayir
2024-02-21Integrate new pretranspose_b_array with extra fused transpose of BGunes Bayir
2024-02-14Fix compiler errors in cl-clangPablo Marquez Tello
2024-02-07Parallelize CPU depthwise over batch if only 1 rowJonathan Deakin
2024-02-06arm_gemm: SME: Remove artificial single-thread constraint on quantized int8 k...David Mansell
2024-01-25arm_gemm: convolution: optimize convolver.hpp.David Mansell
2024-01-12[ONCPUML-1387] Add ACL based reorder for f32 to bf16 data type conversion.Renato Arantes
2023-12-07Optimize CPU depth-to-spaceViet-Hoa Do
2023-12-05Optimize CpuSoftmaxKernel for axis=0Gunes Bayir
2023-11-27BatchNorm changes to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-11-16NormalizationLayer changes to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-11-09Remove duplicate definitions of BF16 fixed format kernels.David Mansell
2023-11-01Fix compilation error with clang and multi-isaViet-Hoa Do
2023-10-31Fix SVE kernel using SVE2 instructionViet-Hoa Do
2023-10-17arm_gemm: Add SME2 FP16 GEMV using FP16->FP32 dot product.David Mansell
2023-10-17Revert "arm_gemm: Add SME2 FP16 GEMV."David Mansell
2023-10-12arm_gemm: Add SME2 FP16 GEMV.David Mansell
2023-10-10arm_gemm: fix 2D threading mode for SME2David Mansell
2023-10-10Fix NEReorderKernel validationDavid Svantesson
2023-10-10Optimize NEStackLayerGunes Bayir
2023-10-03Fix nightly NEON Reverse reference failureAdnan AlSinan
2023-09-28Revise clang-format configurationJakub Sujak
2023-09-28Reimplement erf functionViet-Hoa Do
2023-09-28Apply clang-format on repositoryFelix Thomasmathibalan
2023-09-27Implement tflite compliant reverse for CPUAdnan AlSinan
2023-09-15Fix include dependencies for mass reformatting patchGunes Bayir
2023-08-29GEMM: AArch32: Split assembler block in a32_merge_float_8x6.hppDavid Mansell
2023-08-23Update CpuGemmConv2d and CpuFlatten to use CpuReshape operatorAnitha Raj
2023-08-22CPU: Depthwise: Generate correct size for input indirection array.David Mansell
2023-08-17Fix various static check issuesViet-Hoa Do
2023-07-28Retain back-compatibility for arm_compute/core/Types.hSiCong Li
2023-07-18Break up core/Utils.h to reduce unused code being included everywhereMatthew Bentham
2023-07-13Enable premultiplication for depthwise convolutionMichael Tyler
2023-07-05Fix unused function warningMichael Tyler
2023-07-04Depthwise channel pre-multiplicationMichael Tyler