ComputeLibrary.git -

Age	Commit message (Expand)	Author
2022-09-14	INT8 Quantized MeanStdDevNorm (LayerNorm)	Murray Kornelsen
2022-09-12	Add test for NEGEMM to test a batched matrix multiplication with variable inp...	Adnan AlSinan
2022-09-09	Rework heuristic in ClConv2d	Gian Marco Iodice
2022-09-09	Optimize FP32/16 Bilinear Scale Kernel for Neon™	Gunes Bayir
2022-09-09	Add a macro guard in all OpenCL kernels in gemmlowp.cl	Gian Marco Iodice
2022-09-08	Disable Winograd on fp16 if fast-math = false	Ramy Elgammal
2022-09-07	Optimize depthwise convolution on OpenCL	Gian Marco Iodice
2022-09-02	F16 Specialization for MeanStdDevNorm	Murray Kornelsen
2022-09-02	Enable Winograd-based conv2d when IFM>=8 on Gpu	Gian Marco Iodice
2022-09-01	Use parent buffer in CLSubTensor. This avoids calling enqueueMapBuffer repeat...	Murray Kornelsen
2022-08-24	Fix add for tensors with non-matching strides	Jonathan Deakin
2022-08-24	Fix validation problem in CLQLSTMLayer	Pablo Marquez Tello
2022-08-23	Fix macos build errors	Pablo Marquez Tello
2022-08-18	Use Neon™ kernels for FP Bilinear Resize for SVE	Gunes Bayir
2022-08-17	Revert "Fix performance regression in ClConv2D"	Ramy Elgammal
2022-08-17	Add LUT for quantized sigmoid function	Viet-Hoa Do
2022-08-16	Fix performance regression in ClConv2D	Gian Marco Iodice
2022-08-11	Fix performance regression in Conv2D on OpenCL	Adnan AlSinan
2022-08-11	Disable unsafe FP optimizations in Winograd Output Transform	Gunes Bayir
2022-08-11	Fix CTS/SLTS failure related to Depthwise Convolution	Gunes Bayir
2022-08-08	Fix for AI benchmark ResNet regression	Viet-Hoa Do
2022-08-05	Fix LeNet-f16 convolution regression	Adnan AlSinan
2022-08-04	[ONCPUML-970] Fast math mode for fixed format kernels	Pablo Marquez Tello
2022-08-03	Add Dynamic Fusion Tests with BugFixes	Mohammed Suhail Munshi
2022-08-03	[ONCPUML-968] Fixed format kernel support in additional APIs	Milos Puzovic
2022-08-02	Update the GPUTarget list	Gian Marco Iodice
2022-08-01	Optimize add layer by considering the input tensors as 1D array	Gunes Bayir
2022-08-01	Fix for OpenMP scheduler work breakdown	Milos Puzovic
2022-07-27	Fix compilation error rasied in Nightly_NEW	Ramy Elgammal
2022-07-26	Fix for inclusion of "arm_gemm" from src into "Types.h" from core	Ramy Elgammal
2022-07-25	Enable march=armv8.6-a in non multi-isa builds	Pablo Marquez Tello
2022-07-22	Add GemmLowp MMUL Reshaped Only Rhs Support for QASYMM8/QASYMM8_SIGNED	Freddie Liardet
2022-07-22	Update ClConv2D heuristic to use direct convolution	Adnan AlSinan
2022-07-21	Fix direct convolution cases that were failing on Odroid	Adnan AlSinan
2022-07-19	[ONCPUML-951] Variable weight support for Convolution.	Francesco Petrogalli
2022-07-18	Fix Neoverse V1 heuristics for FP32 fast mode	ramelg01
2022-07-14	Integrate new winograd APIs from MLTech	ramelg01
2022-07-13	Add Gemm MMUL Reshaped Only Rhs Support for FP32/FP16	Gunes Bayir
2022-07-13	Fixed clang-cl errors on Windows native builds.	Pablo Tello
2022-07-08	Extended direct conv 2d interface for tuning the OpenCl kernel	Gian Marco Iodice
2022-07-07	Add missing flag when building cl graph examples and fix	Michalis Spyrou
2022-07-05	Add G57 to GPUTarget	SiCong Li
2022-07-04	Fix build errors on armv8.6 SVE2 with NDK 23 and 24	Michalis Spyrou
2022-07-01	Fix OpenBSD build errors	Pablo Marquez Tello
2022-06-30	Wrong arguments for running activation function in CpuGemmDirectConv2d	Michalis Spyrou
2022-06-29	Add LUT-based leaky relu for QASYMM8 on CPU	Viet-Hoa Do
2022-06-28	Fix OpenCL Winograd output transform	Gian Marco Iodice
2022-06-27	Implement new Elementwise Dynamic Fusion Operators: Div, Floor	Michalis Spyrou
2022-06-24	Improve LUT Neon Hard-Swish	Pablo Marquez Tello
2022-06-23	Select neon LUT Hard-Swish kernel on all devices	Pablo Marquez Tello