aboutsummaryrefslogtreecommitdiff
path: root/src/core/CL/cl_kernels/nhwc
AgeCommit message (Expand)Author
2022-11-01Rework direct convolution heuristic on OpenCLGian Marco Iodice
2022-10-07Workaround CL compiler issue on FP16Viet-Hoa Do
2022-10-06Rework DepthwiseConvolution heuristic on OpenCLGian Marco Iodice
2022-09-07Optimize depthwise convolution on OpenCLGian Marco Iodice
2022-07-21Fix direct convolution cases that were failing on OdroidAdnan AlSinan
2022-06-27Implement new Elementwise Dynamic Fusion Operators: Div, FloorMichalis Spyrou
2022-06-15Fix performance regression in Winograd Output Transform (OpenCL)Gian Marco Iodice
2022-04-19Add CLPool3d Int8 SupportMohammed Suhail Munshi
2022-03-15Implementation of ClPooling3dramelg01
2022-02-10Improve start-up time for winograd_output_transform_*_nhwcramelg01
2022-02-09Remove deprecated remap functions.Adnan AlSinan
2022-02-09Improve start-up time for winograd_input_transform_*_nhwcramelg01
2022-02-08Improve start-up time for winograd_filter_transform_*_nhwcramelg01
2021-12-10Use #if directive instead of regular condition in CLDirectConv2DGiorgio Arena
2021-12-01Improve start-up direct convolution on OpenCLGian Marco Iodice
2021-11-29Use loop unrolling only when the kernel height is less than 5Gian Marco Iodice
2021-11-17Improve start-up timer for ClIm2ColGiorgio Arena
2021-11-17Improve start-up time for depthwise convolutionSheri Zhang
2021-11-09Improve start-up time for ClScaleAdnan AlSinan
2021-10-20Implement CLDirectConv3DKernel - uint8/int8Giorgio Arena
2021-10-15Fix CLConv3D filelist and commentsGiorgio Arena
2021-10-14Implement CLDirectConv3D f32/f16Giorgio Arena
2021-09-14Optimize ClScaleKernel on NHWC (f32/f16/int8)Gian Marco Iodice
2021-08-23Remove padding from ClScaleKernelGiorgio Arena
2021-07-25Reorganize the kernels into nhwc, nchw and common foldersAdnan AlSinan