aboutsummaryrefslogtreecommitdiff
path: root/src/core
AgeCommit message (Expand)Author
2021-09-29Add support for non-constant weights and biases in CpuFullyConnectedGiorgio Arena
2021-09-23Fix inefficient store in gemmlowp_mm_reshaped_only_rhs_tGian Marco Iodice
2021-09-22Update OpenCL header file to version 2020.12.18Sheri Zhang
2021-09-16Revert "Add support for non-constant weights and biases in CpuFullyConnected"Pablo Marquez Tello
2021-09-15Adds Conv3d reference implementation support.Adnan AlSinan
2021-09-15Update cpu heuristics for GEMMGunes Bayir
2021-09-14Optimize ClScaleKernel on NHWC (f32/f16/int8)Gian Marco Iodice
2021-09-09Remove padding from ClGemmMatrixMultiplyReshapedOnlyRhsKernelGiorgio Arena
2021-09-08Fix vload_partial macros on OpenCLGiorgio Arena
2021-09-07Remove padding from ClGemmMatrixMultiplyReshapedKernelGiorgio Arena
2021-09-07Add support for non-constant weights and biases in CpuFullyConnectedMichele Di Giorgio
2021-09-07Update cpu depthwise kernelsFreddie Liardet
2021-09-06Revert "Remove padding from ClGemmMatrixMultiplyReshapedKernel"Pablo Marquez Tello
2021-09-03Remove padding from ClPool2dKernel NCHWGiorgio Arena
2021-09-03Fix CLNormalizationLayer NCHW border calculationSiCongLi
2021-09-01Remove padding from ClGemmMatrixMultiplyReshapedKernelMichele Di Giorgio
2021-09-01Fix strict overflow warningsMichalis Spyrou
2021-08-25Move CPU/GPU files from Core/Runtime to the respective backend foldersGeorgios Pinitas
2021-08-24Re-use auxiliary memory withing CpuWinogradConv2d operatorsGeorgios Pinitas
2021-08-23Remove padding from ClScaleKernelGiorgio Arena
2021-08-13Avoid releasing weights if they are used by multiple functionsGeorgios Pinitas
2021-08-04Fix depthwise convolution assembly kernelsFreddie Liardet
2021-08-04Avoid over-allocation of temporary buffers within CpuWinogradConv2dGeorgios Pinitas
2021-08-02Add missing limits includeFreddie Liardet
2021-08-02Benchmark and set default LWS for GEMM, Direct convolution and WinogradGiorgio Arena
2021-07-30Compilation issue: neon=1 armv8.2 on Android with NDKr18beta1Gian Marco Iodice
2021-07-29Fix A55 performance constant for fp16 hybrid gemm kernelGeorgios Pinitas
2021-07-29Port NEConvolutionLayerMichalis Spyrou
2021-07-28Create custom flags for enabling fp16 supportGeorgios Pinitas
2021-07-28Reduce binary footprint of CpuConvertFullyConnectedWeightsKernelMichele Di Giorgio
2021-07-28Fix bare metal build issuesFreddie Liardet
2021-07-28Fix cpu GEMM fp16 issueFreddie Liardet
2021-07-28Reorganize the kernels into nhwc, nchw and common foldersAdnan AlSinan
2021-07-28Remove generated kernels that overlap hand-written onesGeorgios Pinitas
2021-07-27Port CLGEMMConvolutionLayerManuel Bottini
2021-07-27Update GEMM assembly performance parametersGeorgios Pinitas
2021-07-26Add missing limits includeFreddie Liardet
2021-07-25Reorganize the kernels into nhwc, nchw and common foldersAdnan AlSinan
2021-07-23Fix vector_length identification mechanism for SVEGeorgios Pinitas
2021-07-23Port NEFullyConnectedLayer to memory injecting interfaceMichele Di Giorgio
2021-07-23Fix bare metal build errorFreddie Liardet
2021-07-22Fix oclgrind int overflow warningFreddie Liardet
2021-07-22Update GEMM assembly kernelsGeorgios Pinitas
2021-07-16Include limits header to fix errors reported by GCC11Michele Di Giorgio
2021-07-16Avoid multiple Rhs matrix transformation on ClGemmGeorgios Pinitas
2021-07-16Port CLIm2ColKernel to ClIm2ColKernelManuel Bottini
2021-07-15Port NEGEMMConvolutionLayerManuel Bottini
2021-07-15Improve filelist for GPUGeorgios Pinitas
2021-07-15Port CLCol2ImKernel to ClCol2ImKernelManuel Bottini
2021-07-14Fix CL kernel compilation failureMichalis Spyrou