aboutsummaryrefslogtreecommitdiff
path: root/src/core/NEON/kernels/arm_gemm
AgeCommit message (Expand)Author
6 daysRefactor arm_gemm to enable FP16 in all multi_isa buildsPablo Marquez Tello
2024-05-03arm_gemm: fix SVE check on fast mode kernels.David Mansell
2024-05-01New SME2 heuristics.David Mansell
2024-04-29Disable SME2 Gemmlowp s8f32 kernel selection in case results needs to be accu...Gunes Bayir
2024-04-26Disable SME2 Gemm kernel selection in case results needs to be accumulatedGunes Bayir
2024-04-16fix compilation errors on linux with gcc12Sunita Nadampalli
2024-04-15Add s8f32 kernels and dynamic QuantizationInfoJonathan Deakin
2024-04-11Add in place summation to CPU GEMM kernelsRadu Salavat
2024-03-27Added new NEON fixed format fast math mode hybrid kernel with maximum height ...Milos Puzovic
2024-03-21[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorc...Renato Arantes
2024-03-18Fix quant. gemv kernel driver by adding set_quantized_bias()Gunes Bayir
2024-03-14arm_gemm: Fix bias handling for sme2 FP16 GEMV.David Mansell
2024-02-22Fix OpenBSDĀ® build failure caused by patch 11144Gunes Bayir
2024-02-21Integrate new pretranspose_b_array with extra fused transpose of BGunes Bayir
2024-02-06arm_gemm: SME: Remove artificial single-thread constraint on quantized int8 k...David Mansell
2024-01-25arm_gemm: convolution: optimize convolver.hpp.David Mansell
2024-01-12[ONCPUML-1387] Add ACL based reorder for f32 to bf16 data type conversion.Renato Arantes
2023-11-09Remove duplicate definitions of BF16 fixed format kernels.David Mansell
2023-10-17arm_gemm: Add SME2 FP16 GEMV using FP16->FP32 dot product.David Mansell
2023-10-17Revert "arm_gemm: Add SME2 FP16 GEMV."David Mansell
2023-10-12arm_gemm: Add SME2 FP16 GEMV.David Mansell
2023-10-10arm_gemm: fix 2D threading mode for SME2David Mansell
2023-08-29GEMM: AArch32: Split assembler block in a32_merge_float_8x6.hppDavid Mansell
2023-08-17Fix various static check issuesViet-Hoa Do
2023-06-07Fix build error for armv7aPablo Marquez Tello
2023-06-05Update CPU kernel implementations and guard directivesMichael Tyler
2023-05-03Update a64_transpose_interleave_16.hppDavid Svantesson
2023-04-26Integrate multi-threaded pretranspose_B_arraySiCong Li
2023-03-21gemm_interleaved: Set up the accumulation buffer properly in alternateDavid Mansell
2023-03-13arm_gemm: Add SME2 FP16 kernels.David Mansell
2023-03-07GEMM: SME: Allow threading for quantized GEMMs.David Mansell
2023-02-08Update CPU kernels to remove x19 and w19Michael Tyler
2023-01-18Revert "Update CPU kernels to remove x19"Michael Tyler
2023-01-16Update CPU kernels to remove x19Michael Tyler
2022-12-28Fix company name on copyright noticeViet-Hoa Do
2022-12-28Fix various compilation errorsViet-Hoa Do
2022-11-28Integrate SME2 kernelsViet-Hoa Do
2022-09-23CPU GEMM: Fix overreads in SVE merges.David Mansell
2022-08-04[ONCPUML-970] Fast math mode for fixed format kernelsPablo Marquez Tello
2022-07-19[ONCPUML-951] Variable weight support for Convolution.Francesco Petrogalli
2022-07-18Fix Neoverse V1 heuristics for FP32 fast moderamelg01
2022-07-01Fix OpenBSD build errorsPablo Marquez Tello
2022-05-24[arm_gemm] Import fixed-format kernels from gemm_linux.Francesco.Petrogalli@arm.com
2022-05-06Updating a64_gemm_u8 a64_gemm_s8 kernels headersramelg01
2022-04-06[arm_gemm] Use static validate to find arm_gemm kernels.Francesco.Petrogalli@arm.com
2022-04-04Remove Non-Inclusive Term "Master"ramelg01
2022-03-16Remove deprecated interface from arm_compute.Francesco.Petrogalli@arm.com
2022-03-10Added windows native build supportPablo Tello
2022-01-25Add OpenBSD/arm64 support.Kevin Lo
2021-12-24Replacing non-inclusive terms with proper termsramelg01