aboutsummaryrefslogtreecommitdiff
path: root/src/core
AgeCommit message (Expand)Author
5 hoursDisable SME2 Gemm kernel selection in case results needs to be accumulatedHEADmainGunes Bayir
26 hoursAdd update/index/output (m+1)/2d/(m+n) support for CLScatterGunes Bayir
31 hoursAdd padding to the shift and multipliers buffersPablo Marquez Tello
4 daysScatter GPU Kernel Implementation for 1D tensors.Mohammed Suhail Munshi
10 daysfix compilation errors on linux with gcc12Sunita Nadampalli
11 daysAdd s8f32 kernels and dynamic QuantizationInfoJonathan Deakin
2024-04-11Add SME2 implementation of softmax for FP16Gunes Bayir
2024-04-11Add in place summation to CPU GEMM kernelsRadu Salavat
2024-04-05Fix compiler errorPablo Marquez Tello
2024-03-27Added new NEON fixed format fast math mode hybrid kernel with maximum height ...Milos Puzovic
2024-03-21[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorc...Renato Arantes
2024-03-20Make Cpu/Gpu/Ref scalar/vectoral S32 division consistentGunes Bayir
2024-03-18Fix quant. gemv kernel driver by adding set_quantized_bias()Gunes Bayir
2024-03-14arm_gemm: Fix bias handling for sme2 FP16 GEMV.David Mansell
2024-03-12Fix WoA nightly failurePablo Marquez Tello
2024-02-22Fix segfault in DWC in WoAPablo Marquez Tello
2024-02-22Fix OpenBSD® build failure caused by patch 11144Gunes Bayir
2024-02-21Integrate new pretranspose_b_array with extra fused transpose of BGunes Bayir
2024-02-14Fix compiler errors in cl-clangPablo Marquez Tello
2024-02-07Parallelize CPU depthwise over batch if only 1 rowJonathan Deakin
2024-02-06arm_gemm: SME: Remove artificial single-thread constraint on quantized int8 k...David Mansell
2024-01-25arm_gemm: convolution: optimize convolver.hpp.David Mansell
2024-01-23Fix for unchecked return value detected in Coverity checks.Anitha Raj
2024-01-18Fix divide-by-zero compilation errorViet-Hoa Do
2024-01-17Fix minor issue, clean lut codeMohammed Suhail Munshi
2024-01-12Fix potential threading issue in LUTManagerMohammed Suhail Munshi
2024-01-12[ONCPUML-1387] Add ACL based reorder for f32 to bf16 data type conversion.Renato Arantes
2024-01-10Use look up table for fp16 activationMohammed Suhail Munshi
2023-12-22Fix nightly issue caused by gemm_reshaped_only_rhs_mmul kernelGunes Bayir
2023-12-22Add Mali™-G720 and Mali™-G620 as GpuTargetsGunes Bayir
2023-12-14Fix validation error in CL generate proposals kernelGunes Bayir
2023-12-08Fix validation error in graph_ssd_mobilenetGunes Bayir
2023-12-08Fix unit tests failing in CL/UNIT/TensorAllocatorGunes Bayir
2023-12-07Optimize CPU depth-to-spaceViet-Hoa Do
2023-12-05Optimize CpuSoftmaxKernel for axis=0Gunes Bayir
2023-11-28Changes to enable FP16 in armv8a multi_isaPablo Marquez Tello
2023-11-27BatchNorm changes to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-11-16NormalizationLayer changes to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-11-15Fix various coverity issuesSiCong Li
2023-11-15Fix device issue with CL softmaxViet-Hoa Do
2023-11-09Remove duplicate definitions of BF16 fixed format kernels.David Mansell
2023-11-08Optimize CpuGemmConv2d start-up timeSiCong Li
2023-11-01Fix compilation error with clang and multi-isaViet-Hoa Do
2023-10-31[GPU] Update Reverse layer to allow negative axis and reversed axis orderAdnan AlSinan
2023-10-31Fix SVE kernel using SVE2 instructionViet-Hoa Do
2023-10-31Optimize CL softmaxViet-Hoa Do
2023-10-17arm_gemm: Add SME2 FP16 GEMV using FP16->FP32 dot product.David Mansell
2023-10-17Revert "arm_gemm: Add SME2 FP16 GEMV."David Mansell
2023-10-12arm_gemm: Add SME2 FP16 GEMV.David Mansell
2023-10-12Remove padding from CL comparison operatorViet-Hoa Do