aboutsummaryrefslogtreecommitdiff
path: root/src/runtime/gpu
AgeCommit message (Expand)Author
2021-08-25Move CPU/GPU files from Core/Runtime to the respective backend foldersGeorgios Pinitas
2021-08-23Remove padding from ClScaleKernelGiorgio Arena
2021-08-20Rename [Cl|Cpu]GemmConvolution to [Cl|Gpu]GemmConv2dGeorgios Pinitas
2021-08-18Update the heuristic to call direct convolution in clConv2dGian Marco Iodice
2021-08-18Retain weights in ClGemm when reconfiguring the operator with retentionGeorgios Pinitas
2021-08-11Fix performance regression due to clFinish()Gian Marco Iodice
2021-08-10Fix compiler error in CLActivationLayerPablo Marquez Tello
2021-08-04Implement Operator APIGeorgios Pinitas
2021-08-02Port CLConvolutionLayerSheri Zhang
2021-07-30Port ClFullyConnected to new APIGeorgios Pinitas
2021-07-27Fix memory lifetime issueGeorgios Pinitas
2021-07-27Port CLGEMMConvolutionLayerManuel Bottini
2021-07-26Fix allocation of prepare tensor on ClWinogradConv2dGeorgios Pinitas
2021-07-22Port ClGemmLowp to new APIGeorgios Pinitas
2021-07-16Avoid multiple Rhs matrix transformation on ClGemmGeorgios Pinitas
2021-07-08Port NEGEMMLowp Part 2Manuel Bottini
2021-07-02Align kernel/operator header layoutGeorgios Pinitas
2021-06-30Port ClGemmLowpOutputStage operator to new interfaceGeorgios Pinitas
2021-06-15Port CLWinogradConvolutionLayer with ClWinogradConv2dManuel Bottini
2021-06-15Fix incorrect memory handling in ported functionsManuel Bottini
2021-06-01Fuse activation in ClDirectConv2dKernel for float typesGeorgios Pinitas
2021-06-01Rename ported functionsManuel Bottini
2021-05-25Remove used auxilary tensors from ClGemm's tensor packSang-Hoon Park
2021-05-20Add support for dynamic weights in CL FullyConnected layerGiorgio Arena
2021-05-19Port DepthConvert to new ApiGeorgios Pinitas
2021-05-18Port CLGEMM to memory injecting interfaceGeorgios Pinitas
2021-05-18Port CLFlattenLayer to a memory injecting interfaceGeorgios Pinitas
2021-05-05Rename Quantization/Dequantization kernels/operators to imperative moodGeorgios Pinitas
2021-05-04Rename PixelwiseMultiplications to Mul for simplicityGeorgios Pinitas
2021-04-20Remove OpenCL padding: CLPixelWiseMultiplicationKernelGiorgio Arena
2021-04-13Port CLConvertFullyConnectedWeights to new APITeresa Charlin
2021-03-29Port ClTranspose to new APITeresa Charlin
2021-03-23Make ClDirectConvolutionKernel statelessSheri Zhang
2021-03-23Make ClPixelWiseMultiplicationKernel statelessSheri Zhang
2021-03-12Port OpenCL Scale to new APIManuel Bottini
2021-03-11Port OpenCL Dequantization to new APIManuel Bottini
2021-03-10Port OpenCL Quantization to new APIManuel Bottini
2021-03-08Make Softmax kernels on OpenCL statelessSang-Hoon Park
2021-03-03Create ClPRelu operatorSang-Hoon Park
2021-02-08Make memset/copy functions state-lessSheri Zhang
2021-02-03Make CL Pooling kernels and functions state-lessMichele Di Giorgio
2021-01-27Make CL Elementwise Unary kernels and functions state-lessMichele Di Giorgio
2021-01-27Make Gpu Elementwise Operations kernels and functions state-lessMichele Di Giorgio
2021-01-26Make CLArithmeticSubtraction kernel and function state-lessMichele Di Giorgio
2021-01-26Make CLArithmeticAddition kernel and function state-lessMichele Di Giorgio
2021-01-21Make CLFloor and CLActivation kernels and functions state-lessGeorgios Pinitas
2021-01-20Make all CL Concatenate kernels and functions state-lessMichele Di Giorgio