aboutsummaryrefslogtreecommitdiff
path: root/src/gpu/cl/operators
AgeCommit message (Expand)Author
2022-12-29Update the ClConv2d heuristicGian Marco Iodice
2022-12-14Optimize Transposed Convolution for CL backend (Quantized)Gunes Bayir
2022-12-12Fix build error resulting from incorrect header pathJakub Sujak
2022-12-09Use heuristics for setting dynamic fusion direct conv2d tile sizesRamy Elgammal
2022-12-09Implement the OpenCL kernel to compute the indirect convolutionGian Marco Iodice
2022-11-22Remove dynamic fusion prototype with tests and examplesSiCong Li
2022-11-14Optimize Transposed Convolution for CL backend (FP32/16)Gunes Bayir
2022-09-09Rework heuristic in ClConv2dGian Marco Iodice
2022-09-02Enable Winograd-based conv2d when IFM>=8 on GpuGian Marco Iodice
2022-08-17Revert "Fix performance regression in ClConv2D"Ramy Elgammal
2022-08-16Fix performance regression in ClConv2DGian Marco Iodice
2022-08-11Fix performance regression in Conv2D on OpenCLAdnan AlSinan
2022-08-11Disable unsafe FP optimizations in Winograd Output TransformGunes Bayir
2022-08-05Fix LeNet-f16 convolution regressionAdnan AlSinan
2022-07-22Add GemmLowp MMUL Reshaped Only Rhs Support for QASYMM8/QASYMM8_SIGNEDFreddie Liardet
2022-07-22Update ClConv2D heuristic to use direct convolutionAdnan AlSinan
2022-07-13Add Gemm MMUL Reshaped Only Rhs Support for FP32/FP16Gunes Bayir
2022-07-08Extended direct conv 2d interface for tuning the OpenCl kernelGian Marco Iodice
2022-05-11Fix inclusion guard for dynamic fusion moduleSiCong Li
2022-05-06Integrate Dynamic Fusion patchesSiCong Li
2022-03-15Implementation of ClPooling3dramelg01
2022-02-21Fix performance regression on Arm(R) Mali(TM)-G71Gian Marco Iodice
2022-02-10Fix performance regression on the first layer of convolution-based modelGian Marco Iodice
2022-02-02Revert "Rework gemm_mm_reshaped_only_rhs_ kernels with new macros"Ramy Elgammal
2022-01-25Rework gemm_mm_reshaped_only_rhs_ kernels with new macrosGian Marco Iodice
2022-01-21Fix heuristic in ClConv2DGian Marco Iodice
2021-12-13Remove padding from ClDirectConv2dKernelAdnan AlSinan
2021-12-10Use #if directive instead of regular condition in CLDirectConv2DGiorgio Arena
2021-12-01Improve start-up direct convolution on OpenCLGian Marco Iodice
2021-11-20Improve start-up timer for GeMM (floating-point):ramelg01
2021-11-04Add validate tests for CLConvolutionLayer and CLGEMMConvolutionLayer with pos...SiCongLi
2021-11-02Add post ops to ClGemmMatrixMultiplyReshapedOnlyRHSKernel and ClGemmMatrixMul...SiCongLi
2021-11-01Add PostOp support to GEMM and CLGEMM operators and functions Part 2SiCongLi
2021-10-20Implement CLDirectConv3DKernel - uint8/int8Giorgio Arena
2021-10-18DirectConv3d support refineSheri Zhang
2021-10-18Remove legacy GeMM kernels on OpenCLGian Marco Iodice
2021-10-14Implement CLDirectConv3D f32/f16Giorgio Arena
2021-10-01Provide logging for configure functions in all CL functionsramelg01
2021-09-29Add support for non-constant weights and biases in CpuFullyConnectedGiorgio Arena
2021-09-28Provide logging for configure functions in all gpu operatorsramelg01
2021-09-16Revert "Add support for non-constant weights and biases in CpuFullyConnected"Pablo Marquez Tello
2021-09-07Add support for non-constant weights and biases in CpuFullyConnectedMichele Di Giorgio
2021-09-03Remove padding from ClPool2dKernel NCHWGiorgio Arena
2021-08-25Move CPU/GPU files from Core/Runtime to the respective backend foldersGeorgios Pinitas