ComputeLibrary.git -

Age	Commit message (Expand)	Author
37 hours	Improve CPU extension detection on macosHEAD release_candidate main	Viet-Hoa Do
43 hours	ScatterND fix for scalar cases	Gunes Bayir
4 days	Make quantization rounding consistent	Jonathan Deakin
4 days	Add SME2 implementation of Softmax for QASYMM8 and QASYMM8_SIGNED.branches/arm_compute_24_05	Omar Al Khatib
4 days	Add batched indices support to Scatter GPU Implementation	Mohammed Suhail Munshi
9 days	arm_gemm: fix SVE check on fast mode kernels.	David Mansell
10 days	Change reorder implementation to be vector length agnostic for OHWIo8 reorder	Radu Salavat
11 days	New SME2 heuristics.	David Mansell
12 days	Add fp16 and integer data type support for ScatterNd in Gpu	Gunes Bayir
13 days	Disable SME2 Gemmlowp s8f32 kernel selection in case results needs to be accu...	Gunes Bayir
2024-04-26	Disable SME2 Gemm kernel selection in case results needs to be accumulated	Gunes Bayir
2024-04-25	Add update/index/output (m+1)/2d/(m+n) support for CLScatter	Gunes Bayir
2024-04-25	Add padding to the shift and multipliers buffers	Pablo Marquez Tello
2024-04-22	Scatter GPU Kernel Implementation for 1D tensors.	Mohammed Suhail Munshi
2024-04-16	fix compilation errors on linux with gcc12	Sunita Nadampalli
2024-04-15	Add s8f32 kernels and dynamic QuantizationInfo	Jonathan Deakin
2024-04-12	Accumulation in Cpu Gemm kernels is not supported for quantized kernels in aa...	Radu Salavat
2024-04-11	Add SME2 implementation of softmax for FP16	Gunes Bayir
2024-04-11	Add in place summation to CPU GEMM kernels	Radu Salavat
2024-04-05	Fix compiler error	Pablo Marquez Tello
2024-04-04	Parallelise im2col along dimensions with higher number of iterations	Milos Puzovic
2024-04-02	Add SME2 implementation of softmax for FP32	Viet-Hoa Do
2024-03-27	Added new NEON fixed format fast math mode hybrid kernel with maximum height ...	Milos Puzovic
2024-03-25	Adds Tests and reference implementation for scatter operator with 1D tensors.	Mohammed Suhail Munshi
2024-03-21	Add skeleton for CLScatter op, reference and tests	Mohammed Suhail Munshi
2024-03-21	[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorc...	Renato Arantes
2024-03-20	Make Cpu/Gpu/Ref scalar/vectoral S32 division consistent	Gunes Bayir
2024-03-19	Fix overflow in NEMeanStdDevNormalizationKernel	Pablo Marquez Tello
2024-03-18	Fix quant. gemv kernel driver by adding set_quantized_bias()	Gunes Bayir
2024-03-14	arm_gemm: Fix bias handling for sme2 FP16 GEMV.	David Mansell
2024-03-14	Fix validation in pool2d assembly wrapper	Pablo Marquez Tello
2024-03-12	Optimize CpuSoftmaxKernel for axis != 0 and neon kernels	Omar Al Khatib
2024-03-12	Fix WoA nightly failure	Pablo Marquez Tello
2024-03-11	Prefer indirect Gemm vs. Direct convolution if supported	Gunes Bayir
2024-03-04	Disable FP16 on 32 bit	Pablo Marquez Tello
2024-03-04	Fix performance regression in fixed-format kernels	Gunes Bayir
2024-03-01	Set Neon™ as present for WoA	Pablo Marquez Tello
2024-02-22	Fix segfault in DWC in WoA	Pablo Marquez Tello
2024-02-22	Fix OpenBSD® build failure caused by patch 11144	Gunes Bayir
2024-02-21	Integrate new pretranspose_b_array with extra fused transpose of B	Gunes Bayir
2024-02-20	Requantization cases for offset changes only	Mohammed Suhail Munshi
2024-02-14	Fix compiler errors in cl-clang	Pablo Marquez Tello
2024-02-12	Fix parallel depthwise perf regression from 2db938c	Jonathan Deakin
2024-02-09	Add support for QSYMM8 in ClCastKernel	Pablo Marquez Tello
2024-02-09	Remove CKW prototype and Template Writer	Gunes Bayir
2024-02-08	Fix the bug in GpuTanh operator in dynamic fusion	Gunes Bayir
2024-02-08	Mark GpuSoftmax and GpuReshape as not supported	Gunes Bayir
2024-02-07	Parallelize CPU depthwise over batch if only 1 row	Jonathan Deakin
2024-02-06	arm_gemm: SME: Remove artificial single-thread constraint on quantized int8 k...	David Mansell
2024-02-05	Fix leftover cols in CpuGemmLowpMatrixBReductionKernel	Jonathan Deakin