aboutsummaryrefslogtreecommitdiff
path: root/src/core/NEON/kernels/arm_gemm
AgeCommit message (Expand)Author
6 daysarm_gemm: fix SVE check on fast mode kernels.David Mansell
8 daysNew SME2 heuristics.David Mansell
10 daysDisable SME2 Gemmlowp s8f32 kernel selection in case results needs to be accu...Gunes Bayir
13 daysDisable SME2 Gemm kernel selection in case results needs to be accumulatedGunes Bayir
2024-04-16fix compilation errors on linux with gcc12Sunita Nadampalli
2024-04-15Add s8f32 kernels and dynamic QuantizationInfoJonathan Deakin
2024-04-11Add in place summation to CPU GEMM kernelsRadu Salavat
2024-03-27Added new NEON fixed format fast math mode hybrid kernel with maximum height ...Milos Puzovic
2024-03-21[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorc...Renato Arantes
2024-03-18Fix quant. gemv kernel driver by adding set_quantized_bias()Gunes Bayir
2024-03-14arm_gemm: Fix bias handling for sme2 FP16 GEMV.David Mansell
2024-02-22Fix OpenBSDĀ® build failure caused by patch 11144Gunes Bayir
2024-02-21Integrate new pretranspose_b_array with extra fused transpose of BGunes Bayir
2024-02-06arm_gemm: SME: Remove artificial single-thread constraint on quantized int8 k...David Mansell
2024-01-25arm_gemm: convolution: optimize convolver.hpp.David Mansell
2024-01-12[ONCPUML-1387] Add ACL based reorder for f32 to bf16 data type conversion.Renato Arantes
2023-11-09Remove duplicate definitions of BF16 fixed format kernels.David Mansell
2023-10-17arm_gemm: Add SME2 FP16 GEMV using FP16->FP32 dot product.David Mansell
2023-10-17Revert "arm_gemm: Add SME2 FP16 GEMV."David Mansell
2023-10-12arm_gemm: Add SME2 FP16 GEMV.David Mansell
2023-10-10arm_gemm: fix 2D threading mode for SME2David Mansell
2023-08-29GEMM: AArch32: Split assembler block in a32_merge_float_8x6.hppDavid Mansell
2023-08-17Fix various static check issuesViet-Hoa Do
2023-06-07Fix build error for armv7aPablo Marquez Tello
2023-06-05Update CPU kernel implementations and guard directivesMichael Tyler
2023-05-03Update a64_transpose_interleave_16.hppDavid Svantesson
2023-04-26Integrate multi-threaded pretranspose_B_arraySiCong Li
2023-03-21gemm_interleaved: Set up the accumulation buffer properly in alternateDavid Mansell
2023-03-13arm_gemm: Add SME2 FP16 kernels.David Mansell
2023-03-07GEMM: SME: Allow threading for quantized GEMMs.David Mansell
2023-02-08Update CPU kernels to remove x19 and w19Michael Tyler
2023-01-18Revert "Update CPU kernels to remove x19"Michael Tyler
2023-01-16Update CPU kernels to remove x19Michael Tyler
2022-12-28Fix company name on copyright noticeViet-Hoa Do
2022-12-28Fix various compilation errorsViet-Hoa Do
2022-11-28Integrate SME2 kernelsViet-Hoa Do
2022-09-23CPU GEMM: Fix overreads in SVE merges.David Mansell
2022-08-04[ONCPUML-970] Fast math mode for fixed format kernelsPablo Marquez Tello
2022-07-19[ONCPUML-951] Variable weight support for Convolution.Francesco Petrogalli
2022-07-18Fix Neoverse V1 heuristics for FP32 fast moderamelg01
2022-07-01Fix OpenBSD build errorsPablo Marquez Tello
2022-05-24[arm_gemm] Import fixed-format kernels from gemm_linux.Francesco.Petrogalli@arm.com
2022-05-06Updating a64_gemm_u8 a64_gemm_s8 kernels headersramelg01
2022-04-06[arm_gemm] Use static validate to find arm_gemm kernels.Francesco.Petrogalli@arm.com
2022-04-04Remove Non-Inclusive Term "Master"ramelg01
2022-03-16Remove deprecated interface from arm_compute.Francesco.Petrogalli@arm.com
2022-03-10Added windows native build supportPablo Tello
2022-01-25Add OpenBSD/arm64 support.Kevin Lo
2021-12-24Replacing non-inclusive terms with proper termsramelg01
2021-12-14Update A510 arm_gemm cpu Kernelsramelg01