aboutsummaryrefslogtreecommitdiff
path: root/src/core/NEON/kernels/arm_gemm
AgeCommit message (Expand)Author
2023-11-09Remove duplicate definitions of BF16 fixed format kernels.David Mansell
2023-10-17arm_gemm: Add SME2 FP16 GEMV using FP16->FP32 dot product.David Mansell
2023-10-17Revert "arm_gemm: Add SME2 FP16 GEMV."David Mansell
2023-10-12arm_gemm: Add SME2 FP16 GEMV.David Mansell
2023-10-10arm_gemm: fix 2D threading mode for SME2David Mansell
2023-08-29GEMM: AArch32: Split assembler block in a32_merge_float_8x6.hppDavid Mansell
2023-08-17Fix various static check issuesViet-Hoa Do
2023-06-07Fix build error for armv7aPablo Marquez Tello
2023-06-05Update CPU kernel implementations and guard directivesMichael Tyler
2023-05-03Update a64_transpose_interleave_16.hppDavid Svantesson
2023-04-26Integrate multi-threaded pretranspose_B_arraySiCong Li
2023-03-21gemm_interleaved: Set up the accumulation buffer properly in alternateDavid Mansell
2023-03-13arm_gemm: Add SME2 FP16 kernels.David Mansell
2023-03-07GEMM: SME: Allow threading for quantized GEMMs.David Mansell
2023-02-08Update CPU kernels to remove x19 and w19Michael Tyler
2023-01-18Revert "Update CPU kernels to remove x19"Michael Tyler
2023-01-16Update CPU kernels to remove x19Michael Tyler
2022-12-28Fix company name on copyright noticeViet-Hoa Do
2022-12-28Fix various compilation errorsViet-Hoa Do
2022-11-28Integrate SME2 kernelsViet-Hoa Do
2022-09-23CPU GEMM: Fix overreads in SVE merges.David Mansell
2022-08-04[ONCPUML-970] Fast math mode for fixed format kernelsPablo Marquez Tello
2022-07-19[ONCPUML-951] Variable weight support for Convolution.Francesco Petrogalli
2022-07-18Fix Neoverse V1 heuristics for FP32 fast moderamelg01
2022-07-01Fix OpenBSD build errorsPablo Marquez Tello
2022-05-24[arm_gemm] Import fixed-format kernels from gemm_linux.Francesco.Petrogalli@arm.com
2022-05-06Updating a64_gemm_u8 a64_gemm_s8 kernels headersramelg01
2022-04-06[arm_gemm] Use static validate to find arm_gemm kernels.Francesco.Petrogalli@arm.com
2022-04-04Remove Non-Inclusive Term "Master"ramelg01
2022-03-16Remove deprecated interface from arm_compute.Francesco.Petrogalli@arm.com
2022-03-10Added windows native build supportPablo Tello
2022-01-25Add OpenBSD/arm64 support.Kevin Lo
2021-12-24Replacing non-inclusive terms with proper termsramelg01
2021-12-14Update A510 arm_gemm cpu Kernelsramelg01
2021-11-05Update GeMM heuristic on CPUGian Marco Iodice
2021-09-29Add support for non-constant weights and biases in CpuFullyConnectedGiorgio Arena
2021-09-16Revert "Add support for non-constant weights and biases in CpuFullyConnected"Pablo Marquez Tello
2021-09-15Update cpu heuristics for GEMMGunes Bayir
2021-09-07Add support for non-constant weights and biases in CpuFullyConnectedMichele Di Giorgio
2021-09-07Update cpu depthwise kernelsFreddie Liardet
2021-08-02Add missing limits includeFreddie Liardet
2021-07-30Compilation issue: neon=1 armv8.2 on Android with NDKr18beta1Gian Marco Iodice
2021-07-29Fix A55 performance constant for fp16 hybrid gemm kernelGeorgios Pinitas
2021-07-28Fix bare metal build issuesFreddie Liardet
2021-07-28Remove generated kernels that overlap hand-written onesGeorgios Pinitas
2021-07-27Update GEMM assembly performance parametersGeorgios Pinitas
2021-07-26Add missing limits includeFreddie Liardet
2021-07-23Fix vector_length identification mechanism for SVEGeorgios Pinitas
2021-07-23Fix bare metal build errorFreddie Liardet
2021-07-22Update GEMM assembly kernelsGeorgios Pinitas