ComputeLibrary.git -

Age	Commit message (Expand)	Author
2024-04-25	Add update/index/output (m+1)/2d/(m+n) support for CLScatter	Gunes Bayir
2024-04-22	Scatter GPU Kernel Implementation for 1D tensors.	Mohammed Suhail Munshi
2024-03-25	Adds Tests and reference implementation for scatter operator with 1D tensors.	Mohammed Suhail Munshi
2024-03-21	Add skeleton for CLScatter op, reference and tests	Mohammed Suhail Munshi
2023-12-22	Add Mali™-G720 and Mali™-G620 as GpuTargets	Gunes Bayir
2023-11-15	Fix various coverity issues	SiCong Li
2023-10-31	Optimize CL softmax	Viet-Hoa Do
2023-10-13	Connect MatMul MMUL kernels to ClMatMul operator	Gunes Bayir
2023-09-28	Apply clang-format on repository	Felix Thomasmathibalan
2023-09-04	Remove legacy PostOps code	Jakub Sujak
2023-08-08	Avoid using CLMatMul in CLFullyConnected when GPUTarget is Midgard	ramy.elgammal@arm.com
2023-07-28	Retain back-compatibility for arm_compute/core/Types.h	SiCong Li
2023-07-20	Fix failing CTS tests by disabling matmul when weights conversion is required.	Mohammed Suhail Munshi
2023-07-11	Add Bias to MatMul Kernels and add support for use in Fully Connected Layer	Mohammed Suhail Munshi
2023-07-07	Fix unsupported configuration in CLFullyConnected validation	Gunes Bayir
2023-06-26	Use MatMul in fully connected layer with dynamic weights when supported	Mohammed Suhail Munshi
2023-06-16	Add Fused Activation to OpenCL MatMul	Mohammed Suhail Munshi
2023-06-15	Break up Utils.h a bit to reduce unused code being included everywhere	Matthew Bentham
2023-06-15	Break up arm_compute/core/Types.h a bit	Matthew Bentham
2023-05-05	Connect CLMatMul function to quantized kernels and resolve NE BatchMatMul int...	Jakub Sujak
2023-05-04	Implement OpenCL MatMul heuristic for Arm® Mali™-G710	Gian Marco Iodice
2023-04-14	Align naming convention of ClMatMul	Jakub Sujak
2023-04-04	Support dynamic weights for Fully Connected layers on GPU	Jakub Sujak
2023-04-03	Implement MatMul Function	Ramy Elgammal
2023-01-18	Add broadcast batched matmul validation cases	SiCong Li
2023-01-17	Fix ClGemm crashes on unsupported data types	SiCong Li
2022-12-29	Update the ClConv2d heuristic	Gian Marco Iodice
2022-12-14	Optimize Transposed Convolution for CL backend (Quantized)	Gunes Bayir
2022-12-12	Fix build error resulting from incorrect header path	Jakub Sujak
2022-12-09	Use heuristics for setting dynamic fusion direct conv2d tile sizes	Ramy Elgammal
2022-12-09	Implement the OpenCL kernel to compute the indirect convolution	Gian Marco Iodice
2022-11-22	Remove dynamic fusion prototype with tests and examples	SiCong Li
2022-11-14	Optimize Transposed Convolution for CL backend (FP32/16)	Gunes Bayir
2022-09-09	Rework heuristic in ClConv2d	Gian Marco Iodice
2022-09-02	Enable Winograd-based conv2d when IFM>=8 on Gpu	Gian Marco Iodice
2022-08-17	Revert "Fix performance regression in ClConv2D"	Ramy Elgammal
2022-08-16	Fix performance regression in ClConv2D	Gian Marco Iodice
2022-08-11	Fix performance regression in Conv2D on OpenCL	Adnan AlSinan
2022-08-11	Disable unsafe FP optimizations in Winograd Output Transform	Gunes Bayir
2022-08-05	Fix LeNet-f16 convolution regression	Adnan AlSinan
2022-07-22	Add GemmLowp MMUL Reshaped Only Rhs Support for QASYMM8/QASYMM8_SIGNED	Freddie Liardet
2022-07-22	Update ClConv2D heuristic to use direct convolution	Adnan AlSinan
2022-07-13	Add Gemm MMUL Reshaped Only Rhs Support for FP32/FP16	Gunes Bayir
2022-07-08	Extended direct conv 2d interface for tuning the OpenCl kernel	Gian Marco Iodice
2022-05-11	Fix inclusion guard for dynamic fusion module	SiCong Li
2022-05-06	Integrate Dynamic Fusion patches	SiCong Li
2022-03-15	Implementation of ClPooling3d	ramelg01
2022-02-21	Fix performance regression on Arm(R) Mali(TM)-G71	Gian Marco Iodice
2022-02-10	Fix performance regression on the first layer of convolution-based model	Gian Marco Iodice
2022-02-02	Revert "Rework gemm_mm_reshaped_only_rhs_ kernels with new macros"	Ramy Elgammal