aboutsummaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Expand)Author
2023-03-29Fix GCC13 compiler errorsPablo Marquez Tello
2023-03-29Add quantized support for unary elementwise in CPUViet-Hoa Do
2023-03-27Fix BatchToSpaceFixtureSiCong Li
2023-03-24Add Texture Pipe Support for Matmul Lhs T/NT Rhs T kernelsRamy Elgammal
2023-03-24Work around CLScale compiler-specific issueSiCong Li
2023-03-24Add Texture Pipe Support for Matmul Lhs T/NT Rhs NT kernelsGunes Bayir
2023-03-23Round to nearest with ties to away from zero in ReluPablo Marquez Tello
2023-03-21gemm_interleaved: Set up the accumulation buffer properly in alternateDavid Mansell
2023-03-21Add dynamic weights for CPU fully connected layerViet-Hoa Do
2023-03-20Implement OpenCL MatMul for Lhs T Rhs T/NT FP32/16Gunes Bayir
2023-03-17Implementation of RSQRT for quantized int8Ramy Elgammal
2023-03-17Implement OpenCL MatMul for Lhs NT Rhs T/NT FP32/16Ramy Elgammal
2023-03-14Add CropInfo to BatchToSpace reference and fixtureSiCong Li
2023-03-13arm_gemm: Add SME2 FP16 kernels.David Mansell
2023-03-13[ONCPUML-1174] Allow src/weights mismatch for fixed formatJonathan Deakin
2023-03-08Add support for arbitrary parameters for CPU GatherViet-Hoa Do
2023-03-07Add sigmoid and tanh for dynamic fusionViet-Hoa Do
2023-03-07GEMM: SME: Allow threading for quantized GEMMs.David Mansell
2023-03-07Resolve the presence of variables that are unused in release mode in Dynamic ...Omar Al Khatib
2023-03-06Fix LWS search space used by CLTunerSiCong Li
2023-03-03Add weights_info as optional input for NEDeconvolutionLayerAnnop Wongwathanarat
2023-03-03NEGEMMLowpMatrixMultiplyCore should be configured for optimized int8 kernel.Ethan Doe
2023-03-02Fix direct conv2d in dynamic fusionViet-Hoa Do
2023-03-01Add support for kernel indices in MaxpoolAdnan AlSinan
2023-02-28Add an option to use lowest for max-poolingAdnan AlSinan
2023-02-27Add build option to disable threads hintViet-Hoa Do
2023-02-22Fix configuration files required for Bazel and CMake buildsGunes Bayir
2023-02-15Fix Intermittent Neon™ ReduceMean QASYMM8 MismatchMohammed Suhail Munshi
2023-02-14Extend skip upsampling for deconvolution for non-1x1 kernelsAnnop Wongwathanarat
2023-02-09Fix performance regression in Transposed ConvolutionGunes Bayir
2023-02-08Update CPU kernels to remove x19 and w19Michael Tyler
2023-02-08Add support for dilation > 1 in assembly DepthwiseConvolutionPablo Marquez Tello
2023-02-03Fix armv7a failing GEMMConvolutionLayer testsMohammed Suhail Munshi
2023-02-01Fix GEMMLowp/Batched MatMul mismatches on CPUMohammed Suhail Munshi
2023-02-01Add new operator AddMulAdd for Neon™ backend for Float/Quantized typesGunes Bayir
2023-02-01Add Subtraction operator to Dynamic Fusion interfaceRamy Elgammal
2023-02-01Remove fixed format strides hackJonathan Deakin
2023-01-31Fixed clang-cl linker errorsPablo Tello
2023-01-31Add Multiplication operator (FP only) to Dynamic Fusion InterfaceJakub Sujak
2023-01-31Bazel and CMake buildsDavid Svantesson
2023-01-30Skip upsampling for deconvolution when not neededAnnop Wongwathanarat
2023-01-26Fix num_threads_hint() on macos.Pablo Marquez Tello
2023-01-25Implement dynamic fusion softmax operatorRamy Elgammal
2023-01-24Change dynamic fusion API to return destination tensor infoGunes Bayir
2023-01-23Fix performance regression when stride equal to kernel sizeGunes Bayir
2023-01-20Add missing direct conv2d tests to dynamic fusionSiCong Li
2023-01-20Improve the strided_slice layer on all data typesOmar Al Khatib
2023-01-20Add Auxiliary tensorsSiCong Li
2023-01-20Dynamic Fusion Pooling Layer 2dMohammed Suhail Munshi
2023-01-20Add enable_fast_math for NEDeconvolutionLayerAnnop Wongwathanarat