aboutsummaryrefslogtreecommitdiff
path: root/src/runtime
AgeCommit message (Expand)Author
2021-08-18Enable fast_math on CpuGemmConvolutionv21.08branches/arm_compute_21_08Georgios Pinitas
2021-08-18Update the heuristic to call direct convolution in clConv2dGian Marco Iodice
2021-08-18Retain weights in ClGemm when reconfiguring the operator with retentionGeorgios Pinitas
2021-08-13Avoid releasing weights if they are used by multiple functionsGeorgios Pinitas
2021-08-13Ensure correct transformed matrices are used in CpuGemmConvolutionGeorgios Pinitas
2021-08-12Ensure that correct transformed matrices are used in CpuFullyConnectedGeorgios Pinitas
2021-08-11Fix performance regression due to clFinish()Gian Marco Iodice
2021-08-10Fix compiler error in CLActivationLayerPablo Marquez Tello
2021-08-09Fix compiler error in GCC 7.4 + Ubuntu 16Pablo Marquez Tello
2021-08-04Remove 21.08 deprecated codeFreddie Liardet
2021-08-04Report error for unsupported non-constant weights in CpuFullyConnectedMichele Di Giorgio
2021-08-04Avoid over-allocation of temporary buffers within CpuWinogradConv2dGeorgios Pinitas
2021-08-04Implement Operator APIGeorgios Pinitas
2021-08-02Port CLConvolutionLayerSheri Zhang
2021-07-30Port ClFullyConnected to new APIGeorgios Pinitas
2021-07-30Reintroduce implementation of NEConvolutionLayer::get_convolution_methodMichele Di Giorgio
2021-07-29Port NEConvolutionLayerMichalis Spyrou
2021-07-28Reduce binary footprint of CpuConvertFullyConnectedWeightsKernelMichele Di Giorgio
2021-07-27Fix memory lifetime issueGeorgios Pinitas
2021-07-27Port CLGEMMConvolutionLayerManuel Bottini
2021-07-27Dispatch Conv2d using the Direct method when necessaryGeorgios Pinitas
2021-07-26Fix allocation of prepare tensor on ClWinogradConv2dGeorgios Pinitas
2021-07-23Avoid allocation of auxiliary memory in CpuGemmConvolutionGeorgios Pinitas
2021-07-23Port NEFullyConnectedLayer to memory injecting interfaceMichele Di Giorgio
2021-07-23Pass fast math flag for correct GEMM3D validation supportGeorgios Pinitas
2021-07-22Expose fast_math mode for GEMM through BFloat16Georgios Pinitas
2021-07-22Inject temporary tensors to pack in they don't exist in CpuSoftmaxGeorgios Pinitas
2021-07-22Port ClGemmLowp to new APIGeorgios Pinitas
2021-07-22Update GEMM assembly kernelsGeorgios Pinitas
2021-07-16Avoid multiple Rhs matrix transformation on ClGemmGeorgios Pinitas
2021-07-16Port CLIm2ColKernel to ClIm2ColKernelManuel Bottini
2021-07-15Disabled DirectConv method for NCHW and kernel > 15Pablo Marquez Tello
2021-07-15Port NEGEMMConvolutionLayerManuel Bottini
2021-07-15Port CLCol2ImKernel to ClCol2ImKernelManuel Bottini
2021-07-13Port NEWinogradConvolutionLayerMichalis Spyrou
2021-07-09Port NECol2ImKernelManuel Bottini
2021-07-08Remove redundant implementations of Add/Sub operatorsGeorgios Pinitas
2021-07-08Change CLConvolution selection to run Direct approach on large kernelsGeorgios Pinitas
2021-07-08Port NEGEMMLowp Part 2Manuel Bottini
2021-07-06Port NEIm2ColKernelManuel Bottini
2021-07-02Rework OpenCL Depthwise ConvolutionGian Marco Iodice
2021-07-02Align kernel/operator header layoutGeorgios Pinitas
2021-07-02Port NEGEMM to memory injecting interface (Part 3)Michele Di Giorgio
2021-07-02Implement FP GPU depthwise convolution 1x1 kernel for in-place computationSiCongLi
2021-06-30Port ClGemmLowpOutputStage operator to new interfaceGeorgios Pinitas
2021-06-30Revert "Rework OpenCL Depthwise Convolution"Gian Marco Iodice
2021-06-29Port the ClGemmLowp kernels to the new APIGeorgios Pinitas
2021-06-29Port NEGEMM to memory injecting interface (Part 2)Michele Di Giorgio
2021-06-29Port NEGEMM to memory injecting interface (Part 1)Michele Di Giorgio
2021-06-25Port NEGEMMConv2d to memory injecting interfaceMichele Di Giorgio