aboutsummaryrefslogtreecommitdiff
path: root/src/gpu
AgeCommit message (Expand)Author
5 daysAdd fp16 and integer data type support for ScatterNd in GpuGunes Bayir
10 daysAdd update/index/output (m+1)/2d/(m+n) support for CLScatterGunes Bayir
13 daysScatter GPU Kernel Implementation for 1D tensors.Mohammed Suhail Munshi
2024-03-25Adds Tests and reference implementation for scatter operator with 1D tensors.Mohammed Suhail Munshi
2024-03-21Add skeleton for CLScatter op, reference and testsMohammed Suhail Munshi
2024-03-04Fix performance regression in fixed-format kernelsGunes Bayir
2024-02-09Add support for QSYMM8 in ClCastKernelPablo Marquez Tello
2023-12-22Fix nightly issue caused by gemm_reshaped_only_rhs_mmul kernelGunes Bayir
2023-12-22Add Mali™-G720 and Mali™-G620 as GpuTargetsGunes Bayir
2023-12-15Fix nightly bug caused by not validation 3d cases for input tensorGunes Bayir
2023-12-15Revert "Fix nightly bug caused by wrong validation in Gemm mmul kernel"Gunes Bayir
2023-12-13Fix nightly bug caused by wrong validation in Gemm mmul kernelGunes Bayir
2023-11-15Fix various coverity issuesSiCong Li
2023-11-14Update comments to suppress doxygen warnings.Anitha Raj
2023-10-31Optimize CL softmaxViet-Hoa Do
2023-10-13Connect MatMul MMUL kernels to ClMatMul operatorGunes Bayir
2023-10-05Optimize CLTranspose operatorJakub Sujak
2023-09-28Implement Quantized Matmul Nt/T kernel using MMUL extensionGunes Bayir
2023-09-28Apply clang-format on repositoryFelix Thomasmathibalan
2023-09-18Implement Quantized MatMul kernel using MMUL extensionGunes Bayir
2023-09-14Add skeleton of ClMatMulLowpNativeMMULKernelGunes Bayir
2023-09-04Remove legacy PostOps codeJakub Sujak
2023-08-08Avoid using CLMatMul in CLFullyConnected when GPUTarget is Midgardramy.elgammal@arm.com
2023-07-28Retain back-compatibility for arm_compute/core/Types.hSiCong Li
2023-07-20Fix failing CTS tests by disabling matmul when weights conversion is required.Mohammed Suhail Munshi
2023-07-18Break up core/Utils.h to reduce unused code being included everywhereMatthew Bentham
2023-07-13Added S64/U64 support for the input in CLCastPablo Marquez Tello
2023-07-11Add Bias to MatMul Kernels and add support for use in Fully Connected LayerMohammed Suhail Munshi
2023-07-07Fix unsupported configuration in CLFullyConnected validationGunes Bayir
2023-07-06Fix nightly failures in MatMulLowpNativeKernel when using bounded activation ...Mohammed Suhail Munshi
2023-06-29Implement FP32/16 MatMul Lhs T Rhs T/NT kernel using MMUL extensionGunes Bayir
2023-06-26Add helpers to set CKW tensor components as OpenCL kernel argumentsJakub Sujak
2023-06-26Use MatMul in fully connected layer with dynamic weights when supportedMohammed Suhail Munshi
2023-06-23Implement FP32/FP16 MatMul NT/T kernel using the MMUL extensionRamy Elgammal
2023-06-19Implement FP32/FP16 MatMul NT/NT kernel using the MMUL extensionSiCong Li
2023-06-16Add Fused Activation to OpenCL MatMulMohammed Suhail Munshi
2023-06-15Break up Utils.h a bit to reduce unused code being included everywhereMatthew Bentham
2023-06-15Break up arm_compute/core/Types.h a bitMatthew Bentham
2023-06-06Fix ScaleKernel validate method.Pablo Marquez Tello
2023-05-05Connect CLMatMul function to quantized kernels and resolve NE BatchMatMul int...Jakub Sujak
2023-05-04Implement OpenCL MatMul heuristic for Arm® Mali™-G710Gian Marco Iodice
2023-05-02Fix export_to_cl_image issue in the fp16 GeMM implementationGian Marco Iodice
2023-05-02Add fp16 GeMM heuristic for Arm® Mali™-G710Gian Marco Iodice
2023-04-27Add quantized CL MatMul kernel for LHS NT, RHS TJakub Sujak
2023-04-26Change fp16 GeMM heuristic for Arm® Mali™-G77Gian Marco Iodice
2023-04-26Improve Winograd performance on OpenCLGian Marco Iodice
2023-04-20Implement CL kernel for a native batched matmul Quantized - LHS transposed, R...Omar Al Khatib
2023-04-17Add quantized CL MatMul kernels for Lhs NT/T, Rhs NTGunes Bayir
2023-04-14Align naming convention of ClMatMulJakub Sujak
2023-04-04Support dynamic weights for Fully Connected layers on GPUJakub Sujak