aboutsummaryrefslogtreecommitdiff
path: root/src/core/CL
AgeCommit message (Expand)Author
2021-09-23Fix inefficient store in gemmlowp_mm_reshaped_only_rhs_tGian Marco Iodice
2021-09-22Update OpenCL header file to version 2020.12.18Sheri Zhang
2021-09-14Optimize ClScaleKernel on NHWC (f32/f16/int8)Gian Marco Iodice
2021-09-09Remove padding from ClGemmMatrixMultiplyReshapedOnlyRhsKernelGiorgio Arena
2021-09-08Fix vload_partial macros on OpenCLGiorgio Arena
2021-09-07Remove padding from ClGemmMatrixMultiplyReshapedKernelGiorgio Arena
2021-09-06Revert "Remove padding from ClGemmMatrixMultiplyReshapedKernel"Pablo Marquez Tello
2021-09-03Remove padding from ClPool2dKernel NCHWGiorgio Arena
2021-09-03Fix CLNormalizationLayer NCHW border calculationSiCongLi
2021-09-01Remove padding from ClGemmMatrixMultiplyReshapedKernelMichele Di Giorgio
2021-08-25Move CPU/GPU files from Core/Runtime to the respective backend foldersGeorgios Pinitas
2021-08-23Remove padding from ClScaleKernelGiorgio Arena
2021-08-02Benchmark and set default LWS for GEMM, Direct convolution and WinogradGiorgio Arena
2021-07-27Port CLGEMMConvolutionLayerManuel Bottini
2021-07-25Reorganize the kernels into nhwc, nchw and common foldersAdnan AlSinan
2021-07-22Fix oclgrind int overflow warningFreddie Liardet
2021-07-16Port CLIm2ColKernel to ClIm2ColKernelManuel Bottini
2021-07-15Port CLCol2ImKernel to ClCol2ImKernelManuel Bottini
2021-07-14Fix CL kernel compilation failureMichalis Spyrou
2021-07-13Add in-place calculation support for CL elementwise arithmetic kernelsSheri Zhang
2021-07-09Limit the LOOP_UNROLLING on the kernel heightGian Marco Iodice
2021-07-08Remove redundant implementations of Add/Sub operatorsGeorgios Pinitas
2021-07-06Fix manual LOOP_UNROLLINGGian Marco Iodice
2021-07-02Rework OpenCL Depthwise ConvolutionGian Marco Iodice
2021-07-02Implement FP GPU depthwise convolution 1x1 kernel for in-place computationSiCongLi
2021-07-01Add quantization helper functions for OpenCLGeorgios Pinitas
2021-07-01Adjust minimum DDK version for manual unrollGiorgio Arena
2021-06-30Revert "Rework OpenCL Depthwise Convolution"Gian Marco Iodice
2021-06-29Port the ClGemmLowp kernels to the new APIGeorgios Pinitas
2021-06-29Enable global pooling optimization on OpenCLGian Marco Iodice
2021-06-29Set up the framework to choose the default LWSGiorgio Arena
2021-06-24Rework gemmlowp reshaped_only_rhs using the new macrosGiorgio Arena
2021-06-24Rework OpenCL Depthwise ConvolutionGian Marco Iodice
2021-06-22Add FP16 support to CLRemapFreddie Liardet
2021-06-15Port CLWinogradConvolutionLayer with ClWinogradConv2dManuel Bottini
2021-06-15Add NHWC support to CLRemapFrederick Liardet
2021-06-07Revert "Add optimization for global pooling in pooling_layer.cl"Pablo Tello
2021-06-02Fix bug in PReluLayer when input is 1xN sizeFreddie Liardet
2021-06-01Fuse activation in ClDirectConv2dKernel for float typesGeorgios Pinitas
2021-06-01Rename ported functionsManuel Bottini
2021-05-25Fix retrieving device architecture twice in CLCompileContextGiorgio Arena
2021-05-20Enable unroll through pragma based on DDK versionGiorgio Arena
2021-05-19clCreateKernel failure of CL/ChannelShuffle/U8/Manuel Bottini
2021-05-19Port DepthConvert to new ApiGeorgios Pinitas
2021-05-18Port CLGEMM to memory injecting interfaceGeorgios Pinitas
2021-05-17Add macro to manually unroll loops in OpenCLGiorgio Arena
2021-05-17Fix oclgrind error on CLGEMMLowp reshaped only RHS quantized per channelGiorgio Arena
2021-05-17Fix MeanStdDevNormalizationLayer reference outputting nan for FP16Giorgio Arena
2021-05-13Fix integer overflow and null dereferenceMichalis Spyrou
2021-05-13Remove padding from CLChannelShuffleLayerKernelManuel Bottini