aboutsummaryrefslogtreecommitdiff
AgeCommit message (Expand)Author
3 daysFix linking error to fp16_run_dequantization_core()branches/arm_compute_24_05Ramy Elgammal
3 daysUpdate logic in the OpenMP scheduler to exclude LITTLE coresOmar Al Khatib
4 daysRefactor Dequantize to enable FP16 kernel in v8a multi_isa buildsRamy Elgammal
5 daysFix nightly build errorPablo Marquez Tello
5 daysRework CpuQuantizeKernel to enable FP16 in multi_isa buildsRamy Elgammal
5 daysRefactor arm_gemm to enable FP16 in all multi_isa buildsPablo Marquez Tello
5 daysFix ReductionLayer FP16 for armv8a multi_isa buildsRamy Elgammal
7 daysScatterND fix for scalar casesGunes Bayir
12 daysAdd SME2 implementation of Softmax for QASYMM8 and QASYMM8_SIGNED.Omar Al Khatib
12 daysAdd batched indices support to Scatter GPU ImplementationMohammed Suhail Munshi
2024-05-03arm_gemm: fix SVE check on fast mode kernels.David Mansell
2024-05-02Change reorder implementation to be vector length agnostic for OHWIo8 reorderRadu Salavat
2024-05-01New SME2 heuristics.David Mansell
2024-04-30Add fp16 and integer data type support for ScatterNd in GpuGunes Bayir
2024-04-29Disable SME2 Gemmlowp s8f32 kernel selection in case results needs to be accu...Gunes Bayir
2024-04-26Disable SME2 Gemm kernel selection in case results needs to be accumulatedGunes Bayir
2024-04-25Add update/index/output (m+1)/2d/(m+n) support for CLScatterGunes Bayir
2024-04-25Move s32 to f32 conversion in reference layers from quantization to dequantiz...Radu Salavat
2024-04-25Add memory stress tests for per channel quantized convolutionGunes Bayir
2024-04-25Add padding to the shift and multipliers buffersPablo Marquez Tello
2024-04-24Fix compiler error in the validation testsPablo Marquez Tello
2024-04-22Multi-Dimensional and Batched Scatter Reference and Dataset Implementation.Mohammed Suhail Munshi
2024-04-22Scatter GPU Kernel Implementation for 1D tensors.Mohammed Suhail Munshi
2024-04-17Update documentation for 24.04 releaseMichael Kozlov
2024-04-16Fix v7 test failure when core matmul result is dequantized into fp32Gunes Bayir
2024-04-16fix compilation errors on linux with gcc12Sunita Nadampalli
2024-04-15Add s8f32 kernels and dynamic QuantizationInfoJonathan Deakin
2024-04-15Add guarding for accumulation validation test in aarch32Radu Salavat
2024-04-12Runtime checks for bf16 fixed format testsDavid Svantesson-Yeung
2024-04-12Accumulation in Cpu Gemm kernels is not supported for quantized kernels in aa...Radu Salavat
2024-04-11Add SME2 implementation of softmax for FP16Gunes Bayir
2024-04-11Add in place summation to CPU GEMM kernelsRadu Salavat
2024-04-09Specify absolute toleranceSangwon Ha
2024-04-05Fix compiler errorPablo Marquez Tello
2024-04-04Parallelise im2col along dimensions with higher number of iterationsMilos Puzovic
2024-04-02Add SME2 implementation of softmax for FP32Viet-Hoa Do
2024-03-27Added new NEON fixed format fast math mode hybrid kernel with maximum height ...Milos Puzovic
2024-03-26Fix graph examples for WoAPablo Marquez Tello
2024-03-25Adds Tests and reference implementation for scatter operator with 1D tensors.Mohammed Suhail Munshi
2024-03-22[ONCPUML-1451] Guard bf16 to bf16 tests with ARM_COMPUTE_ENABLE_FIXED_FORMAT_...Renato Arantes
2024-03-22Fix for nightly build failures for androidMohammed Suhail Munshi
2024-03-21Workaround to enable cross-compiling from macOS® to Android™Jakub Sujak
2024-03-21Add skeleton for CLScatter op, reference and testsMohammed Suhail Munshi
2024-03-21[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorc...Renato Arantes
2024-03-20Make Cpu/Gpu/Ref scalar/vectoral S32 division consistentGunes Bayir
2024-03-20Increase tolerance_num of Cpu RNNLayer testsGunes Bayir
2024-03-19Increase MatMul and DilatedConv test Q8 thresholds to 1Gunes Bayir
2024-03-19Fix overflow in NEMeanStdDevNormalizationKernelPablo Marquez Tello
2024-03-18Fix quant. gemv kernel driver by adding set_quantized_bias()Gunes Bayir
2024-03-14arm_gemm: Fix bias handling for sme2 FP16 GEMV.David Mansell