ComputeLibrary.git -

Age	Commit message (Expand)	Author
2023-01-09	Add extend padding lock flag	Ramy Elgammal
2023-01-06	LHS broadcasting addition for dynamic fusion	Viet-Hoa Do
2022-12-29	Optimize CL Scale/Resize Quantized by removing (de)quant. code	Gunes Bayir
2022-12-29	Extend Transposed Conv. for tiles with N0>1	Gunes Bayir
2022-12-28	Fix company name on copyright notice	Viet-Hoa Do
2022-12-28	Fix various compilation errors	Viet-Hoa Do
2022-12-23	Make CLReshape kernel window based on dst instead of src	Ramy Elgammal
2022-12-21	Optimize MeanReduce by integer acc. and removing upfront dequant.	Omar Al Khatib
2022-12-21	Update direct conv2d kernel in dynamic fusion	Gian Marco Iodice
2022-12-21	Optimize SVE natural exponential function	Viet-Hoa Do
2022-12-21	Fixed various mismatches in CpuCastKernel	Pablo Marquez Tello
2022-12-14	Optimize Transposed Convolution for CL backend (Quantized)	Gunes Bayir
2022-12-09	Optimize CPU base-e exponential function on FP32	Viet-Hoa Do
2022-12-09	Implement the OpenCL kernel to compute the indirect convolution	Gian Marco Iodice
2022-11-29	Adding GpuAdd to dynamic fusion operators	Ramy Elgammal
2022-11-28	Integrate SME2 kernels	Viet-Hoa Do
2022-11-25	Implement address precalculation for indirect conv2d - OpenCL	Gian Marco Iodice
2022-11-22	Remove dynamic fusion prototype with tests and examples	SiCong Li
2022-11-15	Fixed Arm NN unit test failure caused by quantised multiplication patch.	Omar Al Khatib
2022-11-14	Optimize Transposed Convolution for CL backend (FP32/16)	Gunes Bayir
2022-11-14	Optimize T_QUANTIZE8_ASYMMETRIC for Mali™ G52	Pablo Marquez Tello
2022-11-10	Fix compiler warnings in dynamic fusion	SiCong Li
2022-11-07	Optimize CPU mul layer on quantized data	Omar Al Khatib
2022-11-03	Fix activation block in gemm.cl	Gian Marco Iodice
2022-11-02	Partially Revert "Add threshold for floating-point SOFT_RELU activation"	Gunes Bayir
2022-11-01	Add threshold for floating-point SOFT_RELU activation	Milos Puzovic
2022-11-01	Rewrite dynamic fusion	SiCong Li
2022-11-01	Rework direct convolution heuristic on OpenCL	Gian Marco Iodice
2022-10-27	Fix fixed-point quantized addition	Viet-Hoa Do
2022-10-24	Add FP16 tanh based on rational approximation	Jonathan Deakin
2022-10-12	Optimize Neon™ Logistic Activation	Mohammed Suhail Munshi
2022-10-07	Workaround CL compiler issue on FP16	Viet-Hoa Do
2022-10-06	Rework DepthwiseConvolution heuristic on OpenCL	Gian Marco Iodice
2022-10-06	Improve start-up time in gemmlowp reshaped rhs only.	Adnan AlSinan
2022-10-03	Force CL kernel compilation with 64 registers	Viet-Hoa Do
2022-10-03	Optimize CPU add layer on quantized data	Viet-Hoa Do
2022-09-28	Fix overflow in NEActivationLayer for FP16 type	Pablo Marquez Tello
2022-09-26	Add FP32 Neon™ swish activation	Jonathan Deakin
2022-09-23	CPU GEMM: Fix overreads in SVE merges.	David Mansell
2022-09-16	Optimize Quantized/Integer Bilinear Scale for Neon™	Gunes Bayir
2022-09-14	Interpreting tensor as 1D for CPU multiplication	Viet-Hoa Do
2022-09-14	Fix invalid memory access for dynamically fused Cl Elementwise kernels	SiCong Li
2022-09-14	Adding GELU activation	Murray Kornelsen
2022-09-14	INT8 Quantized MeanStdDevNorm (LayerNorm)	Murray Kornelsen
2022-09-09	Optimize FP32/16 Bilinear Scale Kernel for Neon™	Gunes Bayir
2022-09-09	Add a macro guard in all OpenCL kernels in gemmlowp.cl	Gian Marco Iodice
2022-09-07	Optimize depthwise convolution on OpenCL	Gian Marco Iodice
2022-09-02	F16 Specialization for MeanStdDevNorm	Murray Kornelsen
2022-09-02	Enable Winograd-based conv2d when IFM>=8 on Gpu	Gian Marco Iodice
2022-08-23	Fix macos build errors	Pablo Marquez Tello