aboutsummaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Expand)Author
2022-12-21Optimize MeanReduce by integer acc. and removing upfront dequant.Omar Al Khatib
2022-12-21Update direct conv2d kernel in dynamic fusionGian Marco Iodice
2022-12-21Optimize SVE natural exponential functionViet-Hoa Do
2022-12-21Fixed various mismatches in CpuCastKernelPablo Marquez Tello
2022-12-16Add output operator for dynamic fusionViet-Hoa Do
2022-12-14Optimize Transposed Convolution for CL backend (Quantized)Gunes Bayir
2022-12-13Add CLAMP operator to Dynamic Fusion interfaceJakub Sujak
2022-12-12Fix build error resulting from incorrect header pathJakub Sujak
2022-12-09Implement Cast operator in dynamic fusionGunes Bayir
2022-12-09Use heuristics for setting dynamic fusion direct conv2d tile sizesRamy Elgammal
2022-12-09Optimize CPU base-e exponential function on FP32Viet-Hoa Do
2022-12-09Implement the OpenCL kernel to compute the indirect convolutionGian Marco Iodice
2022-11-30Fix build error for unused variables in data type specific buildsGunes Bayir
2022-11-29Adding GpuAdd to dynamic fusion operatorsRamy Elgammal
2022-11-28Integrate SME2 kernelsViet-Hoa Do
2022-11-28Implement FP32/16 Depthwise Conv2d operator in dynamic fusionGunes Bayir
2022-11-25Implement address precalculation for indirect conv2d - OpenCLGian Marco Iodice
2022-11-23ONCPUML-1072: Remove double definition of get_mws for Mul kernelfadara01
2022-11-22Remove dynamic fusion prototype with tests and examplesSiCong Li
2022-11-22ONCPUML-1072: Tuned MWS values (for N1, V1) for binary operators used by oneDNNFadi Arafeh
2022-11-18Add num_threads_to_use to OMPScheduler based on workload sizecfRod
2022-11-15Fix regression caused by mws in ActivationLayerMohammed Suhail Munshi
2022-11-15Fixed Arm NN unit test failure caused by quantised multiplication patch.Omar Al Khatib
2022-11-14Optimize Transposed Convolution for CL backend (FP32/16)Gunes Bayir
2022-11-14Optimize T_QUANTIZE8_ASYMMETRIC for Mali™ G52Pablo Marquez Tello
2022-11-10Fix compiler warnings in dynamic fusionSiCong Li
2022-11-09Fix CPU multiplication layer threading overheadViet-Hoa Do
2022-11-08SVE Hard-Swish via Lookup table for quantized inputPablo Marquez Tello
2022-11-07Optimize CPU mul layer on quantized dataOmar Al Khatib
2022-11-04Fix compiler warnings in dynamic fusionSiCong Li
2022-11-03Fix activation block in gemm.clGian Marco Iodice
2022-11-02Partially Revert "Add threshold for floating-point SOFT_RELU activation"Gunes Bayir
2022-11-01Fix fixed-point quantized additionViet-Hoa Do
2022-11-01Updateable weights in depthwise convolutionMilos Puzovic
2022-11-01Add threshold for floating-point SOFT_RELU activationMilos Puzovic
2022-11-01Add check for Batch Matmul in GemmAssemblyDispatchMohammed Suhail Munshi
2022-11-01Rewrite dynamic fusionSiCong Li
2022-11-01Rework direct convolution heuristic on OpenCLGian Marco Iodice
2022-10-27Fix fixed-point quantized additionViet-Hoa Do
2022-10-24Add FP16 tanh based on rational approximationJonathan Deakin
2022-10-20Update reinterpret tensor as 1D for CPU addViet-Hoa Do
2022-10-20Add test in GEMMLowp for batch matmulMohammed Suhail Munshi
2022-10-19Fix FFTConvolutionLayer testViet-Hoa Do
2022-10-12Optimize Neon™ Logistic ActivationMohammed Suhail Munshi
2022-10-12Adding documentation section explaining how BF16 is usedRamy Elgammal
2022-10-10Fix LUT-based activation layerViet-Hoa Do
2022-10-07Workaround CL compiler issue on FP16Viet-Hoa Do
2022-10-07Optimize Neon™ SUB operator by squashing execution windowJakub Sujak
2022-10-06Rework DepthwiseConvolution heuristic on OpenCLGian Marco Iodice
2022-10-06Improve start-up time in gemmlowp reshaped rhs only.Adnan AlSinan