aboutsummaryrefslogtreecommitdiff
path: root/src/core/CL/cl_kernels/nhwc
AgeCommit message (Expand)Author
2024-01-18Fix divide-by-zero compilation errorViet-Hoa Do
2023-04-26Improve Winograd performance on OpenCLGian Marco Iodice
2023-04-03Add Cropping to CLBatchToSpaceOmar Al Khatib
2023-01-18Revert "Update the heuristic for CLDepthwiseConvolutionNative kernel"Gian Marco Iodice
2023-01-12Update the heuristic for CLDepthwiseConvolutionNative kernelGian Marco Iodice
2023-01-10Extend cl image support to input and output tensorsGian Marco Iodice
2022-12-29Optimize CL Scale/Resize Quantized by removing (de)quant. codeGunes Bayir
2022-12-29Extend Transposed Conv. for tiles with N0>1Gunes Bayir
2022-12-21Update direct conv2d kernel in dynamic fusionGian Marco Iodice
2022-12-14Optimize Transposed Convolution for CL backend (Quantized)Gunes Bayir
2022-12-09Implement the OpenCL kernel to compute the indirect convolutionGian Marco Iodice
2022-11-25Implement address precalculation for indirect conv2d - OpenCLGian Marco Iodice
2022-11-14Optimize Transposed Convolution for CL backend (FP32/16)Gunes Bayir
2022-11-01Rework direct convolution heuristic on OpenCLGian Marco Iodice
2022-10-07Workaround CL compiler issue on FP16Viet-Hoa Do
2022-10-06Rework DepthwiseConvolution heuristic on OpenCLGian Marco Iodice
2022-09-07Optimize depthwise convolution on OpenCLGian Marco Iodice
2022-07-21Fix direct convolution cases that were failing on OdroidAdnan AlSinan
2022-06-27Implement new Elementwise Dynamic Fusion Operators: Div, FloorMichalis Spyrou
2022-06-15Fix performance regression in Winograd Output Transform (OpenCL)Gian Marco Iodice
2022-04-19Add CLPool3d Int8 SupportMohammed Suhail Munshi
2022-03-15Implementation of ClPooling3dramelg01
2022-02-10Improve start-up time for winograd_output_transform_*_nhwcramelg01
2022-02-09Remove deprecated remap functions.Adnan AlSinan
2022-02-09Improve start-up time for winograd_input_transform_*_nhwcramelg01
2022-02-08Improve start-up time for winograd_filter_transform_*_nhwcramelg01
2021-12-10Use #if directive instead of regular condition in CLDirectConv2DGiorgio Arena
2021-12-01Improve start-up direct convolution on OpenCLGian Marco Iodice
2021-11-29Use loop unrolling only when the kernel height is less than 5Gian Marco Iodice
2021-11-17Improve start-up timer for ClIm2ColGiorgio Arena
2021-11-17Improve start-up time for depthwise convolutionSheri Zhang
2021-11-09Improve start-up time for ClScaleAdnan AlSinan
2021-10-20Implement CLDirectConv3DKernel - uint8/int8Giorgio Arena
2021-10-15Fix CLConv3D filelist and commentsGiorgio Arena
2021-10-14Implement CLDirectConv3D f32/f16Giorgio Arena
2021-09-14Optimize ClScaleKernel on NHWC (f32/f16/int8)Gian Marco Iodice
2021-08-23Remove padding from ClScaleKernelGiorgio Arena
2021-07-25Reorganize the kernels into nhwc, nchw and common foldersAdnan AlSinan