aboutsummaryrefslogtreecommitdiff
path: root/src/runtime/CL/functions
AgeCommit message (Expand)Author
2023-03-14Add CropInfo to BatchToSpace reference and fixtureSiCong Li
2023-02-09Fix performance regression in Transposed ConvolutionGunes Bayir
2023-01-23Fix performance regression when stride equal to kernel sizeGunes Bayir
2022-12-30Move DWC native heuristic into the heuristic folderGian Marco Iodice
2022-12-29Update the ClConv2d heuristicGian Marco Iodice
2022-12-14Optimize Transposed Convolution for CL backend (Quantized)Gunes Bayir
2022-12-09Implement the OpenCL kernel to compute the indirect convolutionGian Marco Iodice
2022-11-14Optimize Transposed Convolution for CL backend (FP32/16)Gunes Bayir
2022-10-06Rework DepthwiseConvolution heuristic on OpenCLGian Marco Iodice
2022-09-07Optimize depthwise convolution on OpenCLGian Marco Iodice
2022-08-24Fix validation problem in CLQLSTMLayerPablo Marquez Tello
2022-08-11Fix CTS/SLTS failure related to Depthwise ConvolutionGunes Bayir
2022-07-13Add Gemm MMUL Reshaped Only Rhs Support for FP32/FP16Gunes Bayir
2022-03-15Implementation of ClPooling3dramelg01
2022-02-16Fixed threshould argument order in NE/CL/LSTMPablo Marquez Tello
2022-02-09Remove deprecated remap functions.Adnan AlSinan
2021-11-12Fix PostOp dependencySiCongLi
2021-11-01Add PostOp support to GEMM and CLGEMM operators and functions Part 2SiCongLi
2021-10-28Add experimental PostOp interface to ClGemmMatrixMultiplyReshapedKernel Part 1SiCongLi
2021-10-14Implement CLDirectConv3D f32/f16Giorgio Arena
2021-10-07Per-operator build dependenciesGeorgios Pinitas
2021-10-04Provide logging for configure functions in all CPP functionsramelg01
2021-10-01Provide logging for configure functions in all CL functionsramelg01
2021-08-25Move CPU/GPU files from Core/Runtime to the respective backend foldersGeorgios Pinitas
2021-08-20Rename [Cl|Cpu]GemmConvolution to [Cl|Gpu]GemmConv2dGeorgios Pinitas
2021-08-19Address comments on avoiding releasing weights if used by multiple functionsGiorgio Arena
2021-08-18Retain weights in ClGemm when reconfiguring the operator with retentionGeorgios Pinitas
2021-08-13Avoid releasing weights if they are used by multiple functionsGeorgios Pinitas
2021-08-02Port CLConvolutionLayerSheri Zhang
2021-07-30Port ClFullyConnected to new APIGeorgios Pinitas
2021-07-27Fix memory lifetime issueGeorgios Pinitas
2021-07-27Port CLGEMMConvolutionLayerManuel Bottini
2021-07-27Dispatch Conv2d using the Direct method when necessaryGeorgios Pinitas
2021-07-22Port ClGemmLowp to new APIGeorgios Pinitas
2021-07-22Update GEMM assembly kernelsGeorgios Pinitas
2021-07-16Avoid multiple Rhs matrix transformation on ClGemmGeorgios Pinitas
2021-07-16Port CLIm2ColKernel to ClIm2ColKernelManuel Bottini
2021-07-15Port CLCol2ImKernel to ClCol2ImKernelManuel Bottini
2021-07-08Change CLConvolution selection to run Direct approach on large kernelsGeorgios Pinitas
2021-07-02Rework OpenCL Depthwise ConvolutionGian Marco Iodice
2021-07-02Implement FP GPU depthwise convolution 1x1 kernel for in-place computationSiCongLi
2021-06-30Port ClGemmLowpOutputStage operator to new interfaceGeorgios Pinitas
2021-06-30Revert "Rework OpenCL Depthwise Convolution"Gian Marco Iodice
2021-06-29Port the ClGemmLowp kernels to the new APIGeorgios Pinitas
2021-06-25Port NEGEMMConv2d to memory injecting interfaceMichele Di Giorgio
2021-06-24Rework OpenCL Depthwise ConvolutionGian Marco Iodice
2021-06-22Fix Winograd heuristic in CLConvolutionLayerGian Marco Iodice
2021-06-22Add FP16 support to CLRemapFreddie Liardet
2021-06-21Rework the CLConvolutionLayer heuristicGian Marco Iodice
2021-06-18Integrate improved CPU depthwise convolution kernelsMichele Di Giorgio