aboutsummaryrefslogtreecommitdiff
path: root/src/gpu/cl/operators
AgeCommit message (Expand)Author
2024-04-25Add update/index/output (m+1)/2d/(m+n) support for CLScatterGunes Bayir
2024-04-22Scatter GPU Kernel Implementation for 1D tensors.Mohammed Suhail Munshi
2024-03-25Adds Tests and reference implementation for scatter operator with 1D tensors.Mohammed Suhail Munshi
2024-03-21Add skeleton for CLScatter op, reference and testsMohammed Suhail Munshi
2023-12-22Add Mali™-G720 and Mali™-G620 as GpuTargetsGunes Bayir
2023-11-15Fix various coverity issuesSiCong Li
2023-10-31Optimize CL softmaxViet-Hoa Do
2023-10-13Connect MatMul MMUL kernels to ClMatMul operatorGunes Bayir
2023-09-28Apply clang-format on repositoryFelix Thomasmathibalan
2023-09-04Remove legacy PostOps codeJakub Sujak
2023-08-08Avoid using CLMatMul in CLFullyConnected when GPUTarget is Midgardramy.elgammal@arm.com
2023-07-28Retain back-compatibility for arm_compute/core/Types.hSiCong Li
2023-07-20Fix failing CTS tests by disabling matmul when weights conversion is required.Mohammed Suhail Munshi
2023-07-11Add Bias to MatMul Kernels and add support for use in Fully Connected LayerMohammed Suhail Munshi
2023-07-07Fix unsupported configuration in CLFullyConnected validationGunes Bayir
2023-06-26Use MatMul in fully connected layer with dynamic weights when supportedMohammed Suhail Munshi
2023-06-16Add Fused Activation to OpenCL MatMulMohammed Suhail Munshi
2023-06-15Break up Utils.h a bit to reduce unused code being included everywhereMatthew Bentham
2023-06-15Break up arm_compute/core/Types.h a bitMatthew Bentham
2023-05-05Connect CLMatMul function to quantized kernels and resolve NE BatchMatMul int...Jakub Sujak
2023-05-04Implement OpenCL MatMul heuristic for Arm® Mali™-G710Gian Marco Iodice
2023-04-14Align naming convention of ClMatMulJakub Sujak
2023-04-04Support dynamic weights for Fully Connected layers on GPUJakub Sujak
2023-04-03Implement MatMul FunctionRamy Elgammal
2023-01-18Add broadcast batched matmul validation casesSiCong Li
2023-01-17Fix ClGemm crashes on unsupported data typesSiCong Li
2022-12-29Update the ClConv2d heuristicGian Marco Iodice
2022-12-14Optimize Transposed Convolution for CL backend (Quantized)Gunes Bayir
2022-12-12Fix build error resulting from incorrect header pathJakub Sujak
2022-12-09Use heuristics for setting dynamic fusion direct conv2d tile sizesRamy Elgammal
2022-12-09Implement the OpenCL kernel to compute the indirect convolutionGian Marco Iodice
2022-11-22Remove dynamic fusion prototype with tests and examplesSiCong Li
2022-11-14Optimize Transposed Convolution for CL backend (FP32/16)Gunes Bayir
2022-09-09Rework heuristic in ClConv2dGian Marco Iodice
2022-09-02Enable Winograd-based conv2d when IFM>=8 on GpuGian Marco Iodice
2022-08-17Revert "Fix performance regression in ClConv2D"Ramy Elgammal
2022-08-16Fix performance regression in ClConv2DGian Marco Iodice
2022-08-11Fix performance regression in Conv2D on OpenCLAdnan AlSinan
2022-08-11Disable unsafe FP optimizations in Winograd Output TransformGunes Bayir
2022-08-05Fix LeNet-f16 convolution regressionAdnan AlSinan
2022-07-22Add GemmLowp MMUL Reshaped Only Rhs Support for QASYMM8/QASYMM8_SIGNEDFreddie Liardet
2022-07-22Update ClConv2D heuristic to use direct convolutionAdnan AlSinan
2022-07-13Add Gemm MMUL Reshaped Only Rhs Support for FP32/FP16Gunes Bayir
2022-07-08Extended direct conv 2d interface for tuning the OpenCl kernelGian Marco Iodice
2022-05-11Fix inclusion guard for dynamic fusion moduleSiCong Li
2022-05-06Integrate Dynamic Fusion patchesSiCong Li
2022-03-15Implementation of ClPooling3dramelg01
2022-02-21Fix performance regression on Arm(R) Mali(TM)-G71Gian Marco Iodice
2022-02-10Fix performance regression on the first layer of convolution-based modelGian Marco Iodice
2022-02-02Revert "Rework gemm_mm_reshaped_only_rhs_ kernels with new macros"Ramy Elgammal