aboutsummaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2022-09-13Add test case for disable Winograd on fp16 if fast-math = falseRamy Elgammal
2022-09-12Add test for NEGEMM to test a batched matrix multiplication with variable inp...Adnan AlSinan
2022-09-09Rework heuristic in ClConv2dGian Marco Iodice
2022-09-09Optimize FP32/16 Bilinear Scale Kernel for Neon™Gunes Bayir
2022-09-09Add a macro guard in all OpenCL kernels in gemmlowp.clGian Marco Iodice
2022-09-08Disable Winograd on fp16 if fast-math = falseRamy Elgammal
2022-09-07Optimize depthwise convolution on OpenCLGian Marco Iodice
2022-09-07Add test for CLGEMM to test a batched matrix multiplication with variable inp...Mohammed Suhail Munshi
2022-09-02F16 Specialization for MeanStdDevNormMurray Kornelsen
2022-09-02Enable Winograd-based conv2d when IFM>=8 on GpuGian Marco Iodice
2022-09-01Compute Hard-Swish with a Lookup table for qasymm8_signed.Pablo Marquez Tello
2022-09-01Use parent buffer in CLSubTensor. This avoids calling enqueueMapBuffer repeat...Murray Kornelsen
2022-08-24Fix add for tensors with non-matching stridesJonathan Deakin
2022-08-24Fix validation problem in CLQLSTMLayerPablo Marquez Tello
2022-08-23Fix macos build errorsPablo Marquez Tello
2022-08-18Use Neon™ kernels for FP Bilinear Resize for SVEGunes Bayir
2022-08-17Revert "Fix performance regression in ClConv2D"Ramy Elgammal
2022-08-17Add LUT for quantized sigmoid functionViet-Hoa Do
2022-08-16Fix performance regression in ClConv2DGian Marco Iodice
2022-08-12Fix note in guidelines docRamy Elgammal
2022-08-12Update release notes about armv8.6 build flag changeRamy Elgammal
2022-08-11Fix performance regression in Conv2D on OpenCLAdnan AlSinan
2022-08-11Disable unsafe FP optimizations in Winograd Output TransformGunes Bayir
2022-08-11Update READMERamy Elgammal
2022-08-11Fix CTS/SLTS failure related to Depthwise ConvolutionGunes Bayir
2022-08-08Fix for AI benchmark ResNet regressionViet-Hoa Do
2022-08-08Update ErrataRamy Elgammal
2022-08-05Update SONAME_VERSION in SConscriptRamy Elgammal
2022-08-05Fix LeNet-f16 convolution regressionAdnan AlSinan
2022-08-04[ONCPUML-970] Fast math mode for fixed format kernelsPablo Marquez Tello
2022-08-03Add Dynamic Fusion Tests with BugFixesMohammed Suhail Munshi
2022-08-03[ONCPUML-968] Fixed format kernel support in additional APIsMilos Puzovic
2022-08-02Update the GPUTarget listGian Marco Iodice
2022-08-01Optimize add layer by considering the input tensors as 1D arrayGunes Bayir
2022-08-01Fix for OpenMP scheduler work breakdownMilos Puzovic
2022-08-01Fix building failure with validate_examples set to trueMichalis Spyrou
2022-07-29Updated documentationPablo Marquez Tello
2022-07-27Fix compilation error rasied in Nightly_NEWRamy Elgammal
2022-07-26Fix build android build errorPablo Tello
2022-07-26Fix for inclusion of "arm_gemm" from src into "Types.h" from coreRamy Elgammal
2022-07-25Enable integrated assembler for Android™ 13 and onwards onlySiCong Li
2022-07-25Enable march=armv8.6-a in non multi-isa buildsPablo Marquez Tello
2022-07-25Mention Arm® Neoverse® in supported systems.Pablo Marquez Tello
2022-07-22Add GemmLowp MMUL Reshaped Only Rhs Support for QASYMM8/QASYMM8_SIGNEDFreddie Liardet
2022-07-22Update ClConv2D heuristic to use direct convolutionAdnan AlSinan
2022-07-21Fix direct convolution cases that were failing on OdroidAdnan AlSinan
2022-07-21Added CONTRIBUTING.mdPablo Marquez Tello
2022-07-20Remove data extraction scriptsPablo Marquez Tello
2022-07-19[ONCPUML-951] Variable weight support for Convolution.Francesco Petrogalli
2022-07-18Fix multi_isa build failure after Winograd integrationramelg01