ComputeLibrary.git -

Age	Commit message (Expand)	Author
2022-12-21	Optimize MeanReduce by integer acc. and removing upfront dequant.	Omar Al Khatib
2022-12-21	Update direct conv2d kernel in dynamic fusion	Gian Marco Iodice
2022-12-21	Optimize SVE natural exponential function	Viet-Hoa Do
2022-12-21	Fixed various mismatches in CpuCastKernel	Pablo Marquez Tello
2022-12-16	Add output operator for dynamic fusion	Viet-Hoa Do
2022-12-14	Optimize Transposed Convolution for CL backend (Quantized)	Gunes Bayir
2022-12-13	Add CLAMP operator to Dynamic Fusion interface	Jakub Sujak
2022-12-12	Fix build error resulting from incorrect header path	Jakub Sujak
2022-12-09	Implement Cast operator in dynamic fusion	Gunes Bayir
2022-12-09	Use heuristics for setting dynamic fusion direct conv2d tile sizes	Ramy Elgammal
2022-12-09	Optimize CPU base-e exponential function on FP32	Viet-Hoa Do
2022-12-09	Implement the OpenCL kernel to compute the indirect convolution	Gian Marco Iodice
2022-11-30	Fix build error for unused variables in data type specific builds	Gunes Bayir
2022-11-29	Adding GpuAdd to dynamic fusion operators	Ramy Elgammal
2022-11-28	Integrate SME2 kernels	Viet-Hoa Do
2022-11-28	Implement FP32/16 Depthwise Conv2d operator in dynamic fusion	Gunes Bayir
2022-11-25	Implement address precalculation for indirect conv2d - OpenCL	Gian Marco Iodice
2022-11-23	ONCPUML-1072: Remove double definition of get_mws for Mul kernel	fadara01
2022-11-22	Remove dynamic fusion prototype with tests and examples	SiCong Li
2022-11-22	ONCPUML-1072: Tuned MWS values (for N1, V1) for binary operators used by oneDNN	Fadi Arafeh
2022-11-18	Add num_threads_to_use to OMPScheduler based on workload size	cfRod
2022-11-15	Fix regression caused by mws in ActivationLayer	Mohammed Suhail Munshi
2022-11-15	Fixed Arm NN unit test failure caused by quantised multiplication patch.	Omar Al Khatib
2022-11-14	Optimize Transposed Convolution for CL backend (FP32/16)	Gunes Bayir
2022-11-14	Optimize T_QUANTIZE8_ASYMMETRIC for Mali™ G52	Pablo Marquez Tello
2022-11-10	Fix compiler warnings in dynamic fusion	SiCong Li
2022-11-09	Fix CPU multiplication layer threading overhead	Viet-Hoa Do
2022-11-08	SVE Hard-Swish via Lookup table for quantized input	Pablo Marquez Tello
2022-11-07	Optimize CPU mul layer on quantized data	Omar Al Khatib
2022-11-04	Fix compiler warnings in dynamic fusion	SiCong Li
2022-11-03	Fix activation block in gemm.cl	Gian Marco Iodice
2022-11-02	Partially Revert "Add threshold for floating-point SOFT_RELU activation"	Gunes Bayir
2022-11-01	Fix fixed-point quantized addition	Viet-Hoa Do
2022-11-01	Updateable weights in depthwise convolution	Milos Puzovic
2022-11-01	Add threshold for floating-point SOFT_RELU activation	Milos Puzovic
2022-11-01	Add check for Batch Matmul in GemmAssemblyDispatch	Mohammed Suhail Munshi
2022-11-01	Rewrite dynamic fusion	SiCong Li
2022-11-01	Rework direct convolution heuristic on OpenCL	Gian Marco Iodice
2022-10-27	Fix fixed-point quantized addition	Viet-Hoa Do
2022-10-24	Add FP16 tanh based on rational approximation	Jonathan Deakin
2022-10-20	Update reinterpret tensor as 1D for CPU add	Viet-Hoa Do
2022-10-20	Add test in GEMMLowp for batch matmul	Mohammed Suhail Munshi
2022-10-19	Fix FFTConvolutionLayer test	Viet-Hoa Do
2022-10-12	Optimize Neon™ Logistic Activation	Mohammed Suhail Munshi
2022-10-12	Adding documentation section explaining how BF16 is used	Ramy Elgammal
2022-10-10	Fix LUT-based activation layer	Viet-Hoa Do
2022-10-07	Workaround CL compiler issue on FP16	Viet-Hoa Do
2022-10-07	Optimize Neon™ SUB operator by squashing execution window	Jakub Sujak
2022-10-06	Rework DepthwiseConvolution heuristic on OpenCL	Gian Marco Iodice
2022-10-06	Improve start-up time in gemmlowp reshaped rhs only.	Adnan AlSinan