aboutsummaryrefslogtreecommitdiff
path: root/src/core/CL/cl_kernels/tile_helpers.h
AgeCommit message (Expand)Author
2023-07-06Fix nightly failures in MatMulLowpNativeKernel when using bounded activation ...Mohammed Suhail Munshi
2023-05-11Fix invalid vector length in CLViet-Hoa Do
2023-04-17Add quantized CL MatMul kernels for Lhs NT/T, Rhs NTGunes Bayir
2023-03-20Implement OpenCL MatMul for Lhs T Rhs T/NT FP32/16Gunes Bayir
2023-03-17Implement OpenCL MatMul for Lhs NT Rhs T/NT FP32/16Ramy Elgammal
2023-02-01Add Subtraction operator to Dynamic Fusion interfaceRamy Elgammal
2023-01-31Add Multiplication operator (FP only) to Dynamic Fusion InterfaceJakub Sujak
2023-01-10Extend cl image support to input and output tensorsGian Marco Iodice
2023-01-06LHS broadcasting addition for dynamic fusionViet-Hoa Do
2022-12-09Implement the OpenCL kernel to compute the indirect convolutionGian Marco Iodice
2022-11-29Adding GpuAdd to dynamic fusion operatorsRamy Elgammal
2022-11-14Optimize T_QUANTIZE8_ASYMMETRIC for Maliā„¢ G52Pablo Marquez Tello
2022-11-01Rework direct convolution heuristic on OpenCLGian Marco Iodice
2022-07-13Add Gemm MMUL Reshaped Only Rhs Support for FP32/FP16Gunes Bayir
2022-06-27Implement new Elementwise Dynamic Fusion Operators: Div, FloorMichalis Spyrou
2022-05-31Add cl_khr_integer_dot_product extension supportViet-Hoa Do
2022-05-09Mismatches in dynamically fused direct conv2d + add kernelMichalis Spyrou
2022-04-14Include missing embedded headersSiCong Li
2022-02-02Revert "Rework gemm_mm_reshaped_only_rhs_ kernels with new macros"Ramy Elgammal
2022-01-25Rework gemm_mm_reshaped_only_rhs_ kernels with new macrosGian Marco Iodice
2021-12-23Rework gemm_reshape_lhs_ with new macrosAdnan AlSinan
2021-11-26Rework gemm_reshape_rhs_(nt,t) with new macrosGian Marco Iodice
2021-11-09Improve start-up time for ClScaleAdnan AlSinan
2021-10-14Implement CLDirectConv3D f32/f16Giorgio Arena
2021-07-22Fix oclgrind int overflow warningFreddie Liardet
2021-07-14Fix CL kernel compilation failureMichalis Spyrou
2021-07-02Rework OpenCL Depthwise ConvolutionGian Marco Iodice
2021-07-01Add quantization helper functions for OpenCLGeorgios Pinitas
2021-06-30Revert "Rework OpenCL Depthwise Convolution"Gian Marco Iodice
2021-06-24Rework OpenCL Depthwise ConvolutionGian Marco Iodice
2021-05-20Enable unroll through pragma based on DDK versionGiorgio Arena
2021-05-17Add macro to manually unroll loops in OpenCLGiorgio Arena
2021-05-07Fix missing DATA_TYPE in DOT_PRODUCT4_INTEGER8 OpenCL macroGian Marco Iodice
2021-04-20Remove OpenCL padding: CLPixelWiseMultiplicationKernelGiorgio Arena
2021-04-12Add support for cl_image in CLDirectConvolutionLayerGian Marco Iodice
2021-04-08Rework the OpenCL Winograd Input Transformations NHWCGian Marco Iodice
2021-03-25Improve performance of Winograd Output Transform 3x3Gian Marco Iodice
2021-03-23Extend direct convolution (F32/F16/QASYMM8)Gian Marco Iodice