aboutsummaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Expand)Author
2021-06-18Integrate improved CPU depthwise convolution kernelsMichele Di Giorgio
2021-06-18Remove implementation headers from NESoftmaxLayer public headerGeorgios Pinitas
2021-06-15Add CPU discovery capabilities.Georgios Pinitas
2021-06-15Port CLWinogradConvolutionLayer with ClWinogradConv2dManuel Bottini
2021-06-15Fix incorrect memory handling in ported functionsManuel Bottini
2021-06-15Add NHWC support to CLRemapFrederick Liardet
2021-06-11Fix errata in documentationJakub Sujak
2021-06-09Fixed segfault in NEGEMMConv2dPablo Tello
2021-06-08Add guards on SVE kernelsMichalis Spyrou
2021-06-07Revert "Implement memory injection in CpuDirectGemmConv2d"Michele Di Giorgio
2021-06-07Revert "Add optimization for global pooling in pooling_layer.cl"Pablo Tello
2021-06-07Enable fat binary supportGeorgios Pinitas
2021-06-07Fix WeightRetention tests on multiple calls of FullyConnectedGeorgios Pinitas
2021-06-02Fix bug in PReluLayer when input is 1xN sizeFreddie Liardet
2021-06-02Fixed the compiler warning -Werror=type-limitsPablo Marquez Tello
2021-06-01Fuse activation in ClDirectConv2dKernel for float typesGeorgios Pinitas
2021-06-01Rename ported functionsManuel Bottini
2021-06-01Optimize int8 arithmetic addition on CPUGiorgio Arena
2021-05-27Implement memory injection in CpuDirectGemmConv2dSang-Hoon Park
2021-05-26Fix node fusion mutatorSiCongLi
2021-05-26DirectConvolutionLayer create image failureManuel Bottini
2021-05-26Create CpuGemmDirectConv2dSang-Hoon Park
2021-05-25Remove used auxilary tensors from ClGemm's tensor packSang-Hoon Park
2021-05-25Fix retrieving device architecture twice in CLCompileContextGiorgio Arena
2021-05-20Add support for dynamic weights in CL FullyConnected layerGiorgio Arena
2021-05-20Enable unroll through pragma based on DDK versionGiorgio Arena
2021-05-19clCreateKernel failure of CL/ChannelShuffle/U8/Manuel Bottini
2021-05-19Port DepthConvert to new ApiGeorgios Pinitas
2021-05-18Port CLGEMM to memory injecting interfaceGeorgios Pinitas
2021-05-18Remove padding from NERemapKernelManuel Bottini
2021-05-18Port CLFlattenLayer to a memory injecting interfaceGeorgios Pinitas
2021-05-18Port NEFlatten layer to a memory injecting interfaceGeorgios Pinitas
2021-05-18Fix heuristic in Int8 CLDirectConvolutionKernelGian Marco Iodice
2021-05-17Update CPU identification codesGeorgios Pinitas
2021-05-17Add macro to manually unroll loops in OpenCLGiorgio Arena
2021-05-17Move memory management out of CpuPoolingMichele Di Giorgio
2021-05-17Fix oclgrind error on CLGEMMLowp reshaped only RHS quantized per channelGiorgio Arena
2021-05-17Fix MeanStdDevNormalizationLayer reference outputting nan for FP16Giorgio Arena
2021-05-14Fix the OpenCL convolution layer heuristic for int8Gian Marco Iodice
2021-05-14Rename NEGEMMAssembly to CpuGemmAssemblySang-Hoon Park
2021-05-13Fix Macros.h include in NEScaleMichele Di Giorgio
2021-05-13Fix integer overflow and null dereferenceMichalis Spyrou
2021-05-13Remove padding from CLChannelShuffleLayerKernelManuel Bottini
2021-05-13Fix Pooling Layer Bug when input is 1xN sizeFreddie Liardet
2021-05-12Fix GEMMLowp output stage validation crash when input's first dimension == 1Giorgio Arena
2021-05-12Fix bug in Select operator when input is 1xNFreddie Liardet
2021-05-12Fix 'ARM_DOT_K0XN0' macro redefinedGiorgio Arena
2021-05-12Suppress unused variable warning that is present on empty backend buildsGeorgios Pinitas
2021-05-12Remove unused CLCoreRuntimeContextGeorgios Pinitas
2021-05-11Adding S32 Support to NEG operator in CLElementwiseUnaryLayerSuhail Munshi