aboutsummaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Expand)Author
2024-02-01Use the stable CKW API in the GPU dynamic fusion backendGunes Bayir
2024-01-25arm_gemm: convolution: optimize convolver.hpp.David Mansell
2024-01-23Fix for Logically dead code detected in Coverity checksAnitha Raj
2024-01-23Fix for unchecked return value detected in Coverity checks.Anitha Raj
2024-01-23Make GpuWorkloadContext own all tensor info objectsViet-Hoa Do
2024-01-18Fix divide-by-zero compilation errorViet-Hoa Do
2024-01-17Fix minor issue, clean lut codeMohammed Suhail Munshi
2024-01-12Fix potential threading issue in LUTManagerMohammed Suhail Munshi
2024-01-12[ONCPUML-1387] Add ACL based reorder for f32 to bf16 data type conversion.Renato Arantes
2024-01-10Fix compilation error on GCC 13.2Jakub Sujak
2024-01-10Use look up table for fp16 activationMohammed Suhail Munshi
2024-01-04Prevent RELU from being processed thru LUT in INT8Sangwon Ha
2023-12-22Fix nightly issue caused by gemm_reshaped_only_rhs_mmul kernelGunes Bayir
2023-12-22Add Mali™-G720 and Mali™-G620 as GpuTargetsGunes Bayir
2023-12-15Fix nightly bug caused by not validation 3d cases for input tensorGunes Bayir
2023-12-15Revert "Fix nightly bug caused by wrong validation in Gemm mmul kernel"Gunes Bayir
2023-12-14Fix validation error in CL generate proposals kernelGunes Bayir
2023-12-13Fix nightly bug caused by wrong validation in Gemm mmul kernelGunes Bayir
2023-12-12Winograd changes to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-12-08Fix validation error in graph_ssd_mobilenetGunes Bayir
2023-12-08Fix unit tests failing in CL/UNIT/TensorAllocatorGunes Bayir
2023-12-07Optimize CPU depth-to-spaceViet-Hoa Do
2023-12-06Revert "thread_local _custom_scheduler"Pablo Marquez Tello
2023-12-05Optimize CpuSoftmaxKernel for axis=0Gunes Bayir
2023-11-28Changes to enable FP16 in armv8a multi_isaPablo Marquez Tello
2023-11-27BatchNorm changes to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-11-27CpuMul changes to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-11-24thread_local _custom_schedulerDavid Svantesson
2023-11-16NormalizationLayer changes to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-11-15Fix various coverity issuesSiCong Li
2023-11-15Fix device issue with CL softmaxViet-Hoa Do
2023-11-14Update comments to suppress doxygen warnings.Anitha Raj
2023-11-10Fix CpuGemmConv2d int8 segfaultSiCong Li
2023-11-09Remove duplicate definitions of BF16 fixed format kernels.David Mansell
2023-11-09Pooling changes to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-11-09DepthwiseConvolution changes to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-11-08Optimize CpuGemmConv2d start-up timeSiCong Li
2023-11-07Update heuristic for MatMul Native U8Gian Marco Iodice
2023-11-01Add support for Arm® Cortex®-A520 and Arm® Cortex®-R82Viet-Hoa Do
2023-11-01Fix compilation error with clang and multi-isaViet-Hoa Do
2023-10-31[GPU] Update Reverse layer to allow negative axis and reversed axis orderAdnan AlSinan
2023-10-31Extend CKW MatMul with nt_tAdnan AlSinan
2023-10-31Fix SVE kernel using SVE2 instructionViet-Hoa Do
2023-10-31Optimize CL softmaxViet-Hoa Do
2023-10-30DirectConv and Im2Col changes to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-10-26Add check to disable dynamic bias with quantized datatypes in Conv2D layerMohammed Suhail Munshi
2023-10-20FuseBatchNorm changes to enable fp16 in armv8a multi_isa buildsPablo Marquez Tello
2023-10-17arm_gemm: Add SME2 FP16 GEMV using FP16->FP32 dot product.David Mansell
2023-10-17Revert "arm_gemm: Add SME2 FP16 GEMV."David Mansell
2023-10-13Connect MatMul MMUL kernels to ClMatMul operatorGunes Bayir