aboutsummaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Expand)Author
2021-07-22Update GEMM assembly kernelsGeorgios Pinitas
2021-07-19Add layer data to JSON outputFreddie Liardet
2021-07-16Include limits header to fix errors reported by GCC11Michele Di Giorgio
2021-07-16Avoid multiple Rhs matrix transformation on ClGemmGeorgios Pinitas
2021-07-16Port CLIm2ColKernel to ClIm2ColKernelManuel Bottini
2021-07-15Disabled DirectConv method for NCHW and kernel > 15Pablo Marquez Tello
2021-07-15Port NEGEMMConvolutionLayerManuel Bottini
2021-07-15Improve filelist for GPUGeorgios Pinitas
2021-07-15Port CLCol2ImKernel to ClCol2ImKernelManuel Bottini
2021-07-14Fix CL kernel compilation failureMichalis Spyrou
2021-07-13Add in-place calculation support for CL elementwise arithmetic kernelsSheri Zhang
2021-07-13Port NEWinogradConvolutionLayerMichalis Spyrou
2021-07-09Limit the LOOP_UNROLLING on the kernel heightGian Marco Iodice
2021-07-09Port NECol2ImKernelManuel Bottini
2021-07-08Remove redundant implementations of Add/Sub operatorsGeorgios Pinitas
2021-07-08Change CLConvolution selection to run Direct approach on large kernelsGeorgios Pinitas
2021-07-08Add LayerData to all nodesFreddie Liardet
2021-07-08Port NEGEMMLowp Part 2Manuel Bottini
2021-07-07Add basic Operator interfaceGeorgios Pinitas
2021-07-07Validate unsupported data types with runtime informationGeorgios Pinitas
2021-07-06Fix manual LOOP_UNROLLINGGian Marco Iodice
2021-07-06Port NEIm2ColKernelManuel Bottini
2021-07-05Improve implementation selection speed of CpuElementwiseUnaryGeorgios Pinitas
2021-07-02Rework OpenCL Depthwise ConvolutionGian Marco Iodice
2021-07-02Align kernel/operator header layoutGeorgios Pinitas
2021-07-02Port NEGEMM to memory injecting interface (Part 3)Michele Di Giorgio
2021-07-02Implement FP GPU depthwise convolution 1x1 kernel for in-place computationSiCongLi
2021-07-01Add quantization helper functions for OpenCLGeorgios Pinitas
2021-07-01Reduce binary size footprint of CpuGemmInterleave4x4KernelMichele Di Giorgio
2021-07-01Adjust minimum DDK version for manual unrollGiorgio Arena
2021-06-30Port ClGemmLowpOutputStage operator to new interfaceGeorgios Pinitas
2021-06-30Revert "Rework OpenCL Depthwise Convolution"Gian Marco Iodice
2021-06-29Port the ClGemmLowp kernels to the new APIGeorgios Pinitas
2021-06-29Port NEGEMM to memory injecting interface (Part 2)Michele Di Giorgio
2021-06-29Enable global pooling optimization on OpenCLGian Marco Iodice
2021-06-29Port NEGEMM to memory injecting interface (Part 1)Michele Di Giorgio
2021-06-29Improve selection speed of CPU implementationsGeorgios Pinitas
2021-06-29Set up the framework to choose the default LWSGiorgio Arena
2021-06-28Simplify CpuInfo logicGeorgios Pinitas
2021-06-28Add code to detect Mali(TM) G31Pablo Marquez Tello
2021-06-25Port NEGEMMConv2d to memory injecting interfaceMichele Di Giorgio
2021-06-25Rename pooling implementation folder to pool2dGeorgios Pinitas
2021-06-24Rework gemmlowp reshaped_only_rhs using the new macrosGiorgio Arena
2021-06-24Rework OpenCL Depthwise ConvolutionGian Marco Iodice
2021-06-23Create core library using high priority operatorsMichalis Spyrou
2021-06-23Add in-place computation for elementwise operationsSheri Zhang
2021-06-22Port NEGEMMLowp Part 1Manuel Bottini
2021-06-22Fix Winograd heuristic in CLConvolutionLayerGian Marco Iodice
2021-06-22Add FP16 support to CLRemapFreddie Liardet
2021-06-21Rework the CLConvolutionLayer heuristicGian Marco Iodice