aboutsummaryrefslogtreecommitdiff
path: root/src/core
AgeCommit message (Expand)Author
2023-01-09Add extend padding lock flagRamy Elgammal
2023-01-06LHS broadcasting addition for dynamic fusionViet-Hoa Do
2022-12-29Optimize CL Scale/Resize Quantized by removing (de)quant. codeGunes Bayir
2022-12-29Extend Transposed Conv. for tiles with N0>1Gunes Bayir
2022-12-28Fix company name on copyright noticeViet-Hoa Do
2022-12-28Fix various compilation errorsViet-Hoa Do
2022-12-23Make CLReshape kernel window based on dst instead of srcRamy Elgammal
2022-12-21Optimize MeanReduce by integer acc. and removing upfront dequant.Omar Al Khatib
2022-12-21Update direct conv2d kernel in dynamic fusionGian Marco Iodice
2022-12-21Optimize SVE natural exponential functionViet-Hoa Do
2022-12-21Fixed various mismatches in CpuCastKernelPablo Marquez Tello
2022-12-14Optimize Transposed Convolution for CL backend (Quantized)Gunes Bayir
2022-12-09Optimize CPU base-e exponential function on FP32Viet-Hoa Do
2022-12-09Implement the OpenCL kernel to compute the indirect convolutionGian Marco Iodice
2022-11-29Adding GpuAdd to dynamic fusion operatorsRamy Elgammal
2022-11-28Integrate SME2 kernelsViet-Hoa Do
2022-11-25Implement address precalculation for indirect conv2d - OpenCLGian Marco Iodice
2022-11-22Remove dynamic fusion prototype with tests and examplesSiCong Li
2022-11-15Fixed Arm NN unit test failure caused by quantised multiplication patch.Omar Al Khatib
2022-11-14Optimize Transposed Convolution for CL backend (FP32/16)Gunes Bayir
2022-11-14Optimize T_QUANTIZE8_ASYMMETRIC for Mali™ G52Pablo Marquez Tello
2022-11-10Fix compiler warnings in dynamic fusionSiCong Li
2022-11-07Optimize CPU mul layer on quantized dataOmar Al Khatib
2022-11-03Fix activation block in gemm.clGian Marco Iodice
2022-11-02Partially Revert "Add threshold for floating-point SOFT_RELU activation"Gunes Bayir
2022-11-01Add threshold for floating-point SOFT_RELU activationMilos Puzovic
2022-11-01Rewrite dynamic fusionSiCong Li
2022-11-01Rework direct convolution heuristic on OpenCLGian Marco Iodice
2022-10-27Fix fixed-point quantized additionViet-Hoa Do
2022-10-24Add FP16 tanh based on rational approximationJonathan Deakin
2022-10-12Optimize Neon™ Logistic ActivationMohammed Suhail Munshi
2022-10-07Workaround CL compiler issue on FP16Viet-Hoa Do
2022-10-06Rework DepthwiseConvolution heuristic on OpenCLGian Marco Iodice
2022-10-06Improve start-up time in gemmlowp reshaped rhs only.Adnan AlSinan
2022-10-03Force CL kernel compilation with 64 registersViet-Hoa Do
2022-10-03Optimize CPU add layer on quantized dataViet-Hoa Do
2022-09-28Fix overflow in NEActivationLayer for FP16 typePablo Marquez Tello
2022-09-26Add FP32 Neon™ swish activationJonathan Deakin
2022-09-23CPU GEMM: Fix overreads in SVE merges.David Mansell
2022-09-16Optimize Quantized/Integer Bilinear Scale for Neon™Gunes Bayir
2022-09-14Interpreting tensor as 1D for CPU multiplicationViet-Hoa Do
2022-09-14Fix invalid memory access for dynamically fused Cl Elementwise kernelsSiCong Li
2022-09-14Adding GELU activationMurray Kornelsen
2022-09-14INT8 Quantized MeanStdDevNorm (LayerNorm)Murray Kornelsen
2022-09-09Optimize FP32/16 Bilinear Scale Kernel for Neon™Gunes Bayir
2022-09-09Add a macro guard in all OpenCL kernels in gemmlowp.clGian Marco Iodice
2022-09-07Optimize depthwise convolution on OpenCLGian Marco Iodice
2022-09-02F16 Specialization for MeanStdDevNormMurray Kornelsen
2022-09-02Enable Winograd-based conv2d when IFM>=8 on GpuGian Marco Iodice
2022-08-23Fix macos build errorsPablo Marquez Tello