aboutsummaryrefslogtreecommitdiff
path: root/src/core/CL/cl_kernels
AgeCommit message (Expand)Author
2022-03-15Implementation of ClPooling3dramelg01
2022-03-08Merge kernel prototype patchGiorgio Arena
2022-02-11Improve start-up time for concatenation layersramelg01
2022-02-10Improve start-up time for winograd_output_transform_*_nhwcramelg01
2022-02-09Remove deprecated remap functions.Adnan AlSinan
2022-02-09Improve start-up time for winograd_input_transform_*_nhwcramelg01
2022-02-08Improve start-up time for winograd_filter_transform_*_nhwcramelg01
2022-02-02Revert "Rework gemm_mm_reshaped_only_rhs_ kernels with new macros"Ramy Elgammal
2022-01-25Rework gemm_mm_reshaped_only_rhs_ kernels with new macrosGian Marco Iodice
2021-12-23Rework gemm_reshape_lhs_ with new macrosAdnan AlSinan
2021-12-13Remove padding from ClDirectConv2dKernelAdnan AlSinan
2021-12-10Use #if directive instead of regular condition in CLDirectConv2DGiorgio Arena
2021-12-01Improve start-up direct convolution on OpenCLGian Marco Iodice
2021-11-29Use loop unrolling only when the kernel height is less than 5Gian Marco Iodice
2021-11-26Rework gemm_reshape_rhs_(nt,t) with new macrosGian Marco Iodice
2021-11-20Improve start-up timer for GeMM (floating-point):ramelg01
2021-11-17Improve start-up timer for ClIm2ColGiorgio Arena
2021-11-17Improve start-up time for depthwise convolutionSheri Zhang
2021-11-09Improve start-up time for ClScaleAdnan AlSinan
2021-11-04Add validate tests for CLConvolutionLayer and CLGEMMConvolutionLayer with pos...SiCongLi
2021-11-04Add PRelu to supported PostOps in:ramelg01
2021-11-03Fix out-of-bound reads in cl gemm kernelsSiCongLi
2021-11-02Add post ops to ClGemmMatrixMultiplyReshapedOnlyRHSKernel and ClGemmMatrixMul...SiCongLi
2021-11-01Remove padding in FP Cl Gemm kernelsSiCongLi
2021-10-28Add experimental PostOp interface to ClGemmMatrixMultiplyReshapedKernel Part 1SiCongLi
2021-10-20Implement CLDirectConv3DKernel - uint8/int8Giorgio Arena
2021-10-18Remove legacy GeMM kernels on OpenCLGian Marco Iodice
2021-10-18Fix precision issue in ChannelShuffleKernelPablo Marquez Tello
2021-10-15Fix CLConv3D filelist and commentsGiorgio Arena
2021-10-14Implement CLDirectConv3D f32/f16Giorgio Arena
2021-10-13Improve performance of Softmax uint8 on GPUAdnan AlSinan
2021-09-23Fix inefficient store in gemmlowp_mm_reshaped_only_rhs_tGian Marco Iodice
2021-09-14Optimize ClScaleKernel on NHWC (f32/f16/int8)Gian Marco Iodice
2021-09-09Remove padding from ClGemmMatrixMultiplyReshapedOnlyRhsKernelGiorgio Arena
2021-09-08Fix vload_partial macros on OpenCLGiorgio Arena
2021-09-07Remove padding from ClGemmMatrixMultiplyReshapedKernelGiorgio Arena
2021-09-06Revert "Remove padding from ClGemmMatrixMultiplyReshapedKernel"Pablo Marquez Tello
2021-09-03Remove padding from ClPool2dKernel NCHWGiorgio Arena
2021-09-03Fix CLNormalizationLayer NCHW border calculationSiCongLi
2021-09-01Remove padding from ClGemmMatrixMultiplyReshapedKernelMichele Di Giorgio
2021-08-23Remove padding from ClScaleKernelGiorgio Arena
2021-07-25Reorganize the kernels into nhwc, nchw and common foldersAdnan AlSinan
2021-07-22Fix oclgrind int overflow warningFreddie Liardet
2021-07-14Fix CL kernel compilation failureMichalis Spyrou
2021-07-13Add in-place calculation support for CL elementwise arithmetic kernelsSheri Zhang
2021-07-09Limit the LOOP_UNROLLING on the kernel heightGian Marco Iodice
2021-07-08Remove redundant implementations of Add/Sub operatorsGeorgios Pinitas
2021-07-02Rework OpenCL Depthwise ConvolutionGian Marco Iodice
2021-07-01Add quantization helper functions for OpenCLGeorgios Pinitas
2021-06-30Revert "Rework OpenCL Depthwise Convolution"Gian Marco Iodice