aboutsummaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
2 daysAdded new NEON fixed format fast math mode hybrid kernel with maximum height ...HEADrelease_candidatemainMilos Puzovic
3 daysFix graph examples for WoAPablo Marquez Tello
4 daysAdds Tests and reference implementation for scatter operator with 1D tensors.Mohammed Suhail Munshi
7 days[ONCPUML-1451] Guard bf16 to bf16 tests with ARM_COMPUTE_ENABLE_FIXED_FORMAT_...Renato Arantes
7 daysFix for nightly build failures for androidMohammed Suhail Munshi
8 daysWorkaround to enable cross-compiling from macOS® to Android™Jakub Sujak
8 daysAdd skeleton for CLScatter op, reference and testsMohammed Suhail Munshi
8 days[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorc...Renato Arantes
9 daysMake Cpu/Gpu/Ref scalar/vectoral S32 division consistentGunes Bayir
9 daysIncrease tolerance_num of Cpu RNNLayer testsGunes Bayir
10 daysIncrease MatMul and DilatedConv test Q8 thresholds to 1Gunes Bayir
10 daysFix overflow in NEMeanStdDevNormalizationKernelPablo Marquez Tello
11 daysFix quant. gemv kernel driver by adding set_quantized_bias()Gunes Bayir
2024-03-14arm_gemm: Fix bias handling for sme2 FP16 GEMV.David Mansell
2024-03-14Fix validation in pool2d assembly wrapperPablo Marquez Tello
2024-03-12Optimize CpuSoftmaxKernel for axis != 0 and neon kernelsOmar Al Khatib
2024-03-12Update documentation for 24.02.1 releaseFelix Thomasmathibalan
2024-03-12Fix WoA nightly failurePablo Marquez Tello
2024-03-11Prefer indirect Gemm vs. Direct convolution if supportedGunes Bayir
2024-03-11Set int8 test tolerance in FullyConnected to int8Gunes Bayir
2024-03-04Disable FP16 on 32 bitPablo Marquez Tello
2024-03-04Fix performance regression in fixed-format kernelsGunes Bayir
2024-03-01Set Neon™ as present for WoAPablo Marquez Tello
2024-02-22Fix segfault in DWC in WoAPablo Marquez Tello
2024-02-22Fix OpenBSD® build failure caused by patch 11144Gunes Bayir
2024-02-21Integrate new pretranspose_b_array with extra fused transpose of BGunes Bayir
2024-02-20Requantization cases for offset changes onlyMohammed Suhail Munshi
2024-02-15Fix linker errors in validation suite for WoAPablo Marquez Tello
2024-02-15Fix validation suite on WoAPablo Marquez Tello
2024-02-14Fix escape character issues in format_code scriptGunes Bayir
2024-02-14Fix compiler errors in cl-clangPablo Marquez Tello
2024-02-14[QTest] Use dynamic output quantization in Depthwise Conv testsOmar Al Khatib
2024-02-12Disable some DirectConv2d tests in Dynamic FusionGunes Bayir
2024-02-12Update documentation for 24.02 releaseFelix Thomasmathibalan
2024-02-12Fix parallel depthwise perf regression from 2db938cJonathan Deakin
2024-02-09Add support for QSYMM8 in ClCastKernelPablo Marquez Tello
2024-02-09Remove CKW prototype and Template WriterGunes Bayir
2024-02-08Fix the bug in GpuTanh operator in dynamic fusionGunes Bayir
2024-02-08Mark GpuSoftmax and GpuReshape as not supportedGunes Bayir
2024-02-07Parallelize CPU depthwise over batch if only 1 rowJonathan Deakin
2024-02-06arm_gemm: SME: Remove artificial single-thread constraint on quantized int8 k...David Mansell
2024-02-06Build CKW by defaultGunes Bayir
2024-02-06Fix compilation issue in CKW due to unused variableGunes Bayir
2024-02-06Fix pathJakub Sujak
2024-02-06Disable FP16 tests compilation on Multi-Isa v8aMohammed Suhail Munshi
2024-02-05Fix leftover cols in CpuGemmLowpMatrixBReductionKernelJonathan Deakin
2024-02-02Add scripts to generate Doxygen documentationJakub Sujak
2024-02-01Use the stable CKW API in the GPU dynamic fusion backendGunes Bayir
2024-01-31Fix logic in SConscriptJakub Sujak
2024-01-31Add build options for Address and UndefinedBehavior sanitizersJakub Sujak