aboutsummaryrefslogtreecommitdiff
path: root/src/gpu/cl/kernels
AgeCommit message (Expand)Author
2024-04-22Scatter GPU Kernel Implementation for 1D tensors.Mohammed Suhail Munshi
2024-03-21Add skeleton for CLScatter op, reference and testsMohammed Suhail Munshi
2024-02-09Add support for QSYMM8 in ClCastKernelPablo Marquez Tello
2023-12-22Fix nightly issue caused by gemm_reshaped_only_rhs_mmul kernelGunes Bayir
2023-12-22Add Mali™-G720 and Mali™-G620 as GpuTargetsGunes Bayir
2023-12-15Fix nightly bug caused by not validation 3d cases for input tensorGunes Bayir
2023-12-15Revert "Fix nightly bug caused by wrong validation in Gemm mmul kernel"Gunes Bayir
2023-12-13Fix nightly bug caused by wrong validation in Gemm mmul kernelGunes Bayir
2023-11-14Update comments to suppress doxygen warnings.Anitha Raj
2023-10-31Optimize CL softmaxViet-Hoa Do
2023-10-05Optimize CLTranspose operatorJakub Sujak
2023-09-28Implement Quantized Matmul Nt/T kernel using MMUL extensionGunes Bayir
2023-09-28Apply clang-format on repositoryFelix Thomasmathibalan
2023-09-18Implement Quantized MatMul kernel using MMUL extensionGunes Bayir
2023-09-14Add skeleton of ClMatMulLowpNativeMMULKernelGunes Bayir
2023-09-04Remove legacy PostOps codeJakub Sujak
2023-07-28Retain back-compatibility for arm_compute/core/Types.hSiCong Li
2023-07-18Break up core/Utils.h to reduce unused code being included everywhereMatthew Bentham
2023-07-13Added S64/U64 support for the input in CLCastPablo Marquez Tello
2023-07-11Add Bias to MatMul Kernels and add support for use in Fully Connected LayerMohammed Suhail Munshi
2023-07-06Fix nightly failures in MatMulLowpNativeKernel when using bounded activation ...Mohammed Suhail Munshi
2023-06-29Implement FP32/16 MatMul Lhs T Rhs T/NT kernel using MMUL extensionGunes Bayir
2023-06-26Add helpers to set CKW tensor components as OpenCL kernel argumentsJakub Sujak
2023-06-26Use MatMul in fully connected layer with dynamic weights when supportedMohammed Suhail Munshi
2023-06-23Implement FP32/FP16 MatMul NT/T kernel using the MMUL extensionRamy Elgammal
2023-06-19Implement FP32/FP16 MatMul NT/NT kernel using the MMUL extensionSiCong Li
2023-06-16Add Fused Activation to OpenCL MatMulMohammed Suhail Munshi
2023-06-15Break up arm_compute/core/Types.h a bitMatthew Bentham
2023-06-06Fix ScaleKernel validate method.Pablo Marquez Tello
2023-05-05Connect CLMatMul function to quantized kernels and resolve NE BatchMatMul int...Jakub Sujak
2023-05-02Fix export_to_cl_image issue in the fp16 GeMM implementationGian Marco Iodice
2023-05-02Add fp16 GeMM heuristic for Arm® Mali™-G710Gian Marco Iodice
2023-04-26Change fp16 GeMM heuristic for Arm® Mali™-G77Gian Marco Iodice
2023-04-26Improve Winograd performance on OpenCLGian Marco Iodice
2023-04-17Add quantized CL MatMul kernels for Lhs NT/T, Rhs NTGunes Bayir
2023-04-14Align naming convention of ClMatMulJakub Sujak
2023-04-03Implement MatMul FunctionRamy Elgammal
2023-03-24Work around CLScale compiler-specific issueSiCong Li
2023-03-24Add Texture Pipe Support for Matmul Lhs T/NT Rhs NT kernelsGunes Bayir
2023-03-20Implement OpenCL MatMul for Lhs T Rhs T/NT FP32/16Gunes Bayir
2023-03-17Implementation of RSQRT for quantized int8Ramy Elgammal
2023-03-17Implement OpenCL MatMul for Lhs NT Rhs T/NT FP32/16Ramy Elgammal
2023-03-06Fix LWS search space used by CLTunerSiCong Li
2023-02-28Add an option to use lowest for max-poolingAdnan AlSinan
2023-01-10Fix CL DirectConvolutionLayer validate testsSiCong Li
2023-01-10Extend cl image support to input and output tensorsGian Marco Iodice
2022-12-29Optimize CL Scale/Resize Quantized by removing (de)quant. codeGunes Bayir
2022-12-29Extend Transposed Conv. for tiles with N0>1Gunes Bayir
2022-12-23Make CLReshape kernel window based on dst instead of srcRamy Elgammal
2022-12-14Optimize Transposed Convolution for CL backend (Quantized)Gunes Bayir