aboutsummaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Expand)Author
2022-08-17Revert "Fix performance regression in ClConv2D"v22.08branches/arm_compute_22_08Ramy Elgammal
2022-08-16Fix performance regression in ClConv2DGian Marco Iodice
2022-08-12Fix performance regression in Conv2D on OpenCLAdnan AlSinan
2022-08-11Disable unsafe FP optimizations in Winograd Output TransformGunes Bayir
2022-08-11Fix CTS/SLTS failure related to Depthwise ConvolutionGunes Bayir
2022-08-08Fix for AI benchmark ResNet regressionViet-Hoa Do
2022-08-05Fix LeNet-f16 convolution regressionAdnan AlSinan
2022-08-04[ONCPUML-970] Fast math mode for fixed format kernelsPablo Marquez Tello
2022-08-03Add Dynamic Fusion Tests with BugFixesMohammed Suhail Munshi
2022-08-03[ONCPUML-968] Fixed format kernel support in additional APIsMilos Puzovic
2022-08-02Update the GPUTarget listGian Marco Iodice
2022-08-01Optimize add layer by considering the input tensors as 1D arrayGunes Bayir
2022-08-01Fix for OpenMP scheduler work breakdownMilos Puzovic
2022-07-27Fix compilation error rasied in Nightly_NEWRamy Elgammal
2022-07-26Fix for inclusion of "arm_gemm" from src into "Types.h" from coreRamy Elgammal
2022-07-25Enable march=armv8.6-a in non multi-isa buildsPablo Marquez Tello
2022-07-22Add GemmLowp MMUL Reshaped Only Rhs Support for QASYMM8/QASYMM8_SIGNEDFreddie Liardet
2022-07-22Update ClConv2D heuristic to use direct convolutionAdnan AlSinan
2022-07-21Fix direct convolution cases that were failing on OdroidAdnan AlSinan
2022-07-19[ONCPUML-951] Variable weight support for Convolution.Francesco Petrogalli
2022-07-18Fix Neoverse V1 heuristics for FP32 fast moderamelg01
2022-07-14Integrate new winograd APIs from MLTechramelg01
2022-07-13Add Gemm MMUL Reshaped Only Rhs Support for FP32/FP16Gunes Bayir
2022-07-13Fixed clang-cl errors on Windows native builds.Pablo Tello
2022-07-08Extended direct conv 2d interface for tuning the OpenCl kernelGian Marco Iodice
2022-07-07Add missing flag when building cl graph examples and fixMichalis Spyrou
2022-07-05Add G57 to GPUTargetSiCong Li
2022-07-04Fix build errors on armv8.6 SVE2 with NDK 23 and 24Michalis Spyrou
2022-07-01Fix OpenBSD build errorsPablo Marquez Tello
2022-06-30Wrong arguments for running activation function in CpuGemmDirectConv2dMichalis Spyrou
2022-06-29Add LUT-based leaky relu for QASYMM8 on CPUViet-Hoa Do
2022-06-28Fix OpenCL Winograd output transformGian Marco Iodice
2022-06-27Implement new Elementwise Dynamic Fusion Operators: Div, FloorMichalis Spyrou
2022-06-24Improve LUT Neon Hard-SwishPablo Marquez Tello
2022-06-23Select neon LUT Hard-Swish kernel on all devicesPablo Marquez Tello
2022-06-23Enable loading OpenCL symbols in Android App native code, NDK environment.ohadagoogle
2022-06-16Fix SVE2 implementation of quantized SoftMax 1DViet-Hoa Do
2022-06-15Fix build error v8.2-a-svePablo Marquez Tello
2022-06-15Add support OpenCL 3.0 non-uniform workgroupViet-Hoa Do
2022-06-15Fix performance regression in Winograd Output Transform (OpenCL)Gian Marco Iodice
2022-06-13Add support for 2d and 3d indices for axis 1Pablo Marquez Tello
2022-06-09Fix crash in CpuActivationKernelPablo Marquez Tello
2022-06-07Compute Hard-Swish with a Lookup table for qasymm8.Pablo Marquez Tello
2022-05-31Add cl_khr_integer_dot_product extension supportViet-Hoa Do
2022-05-31Fix compilation error in 32-bit with logging buildViet-Hoa Do
2022-05-26Disable unsafe FP optimizations causing accuracy issuesGunes Bayir
2022-05-24[arm_gemm] Import fixed-format kernels from gemm_linux.Francesco.Petrogalli@arm.com
2022-05-17DepthwiseConv reports full assembly kernel namePablo Marquez Tello
2022-05-12Revert "Add support for 2d and 3d indices for axis 0"Mohammed Suhail Munshi
2022-05-11Fix inclusion guard for dynamic fusion moduleSiCong Li