ComputeLibrary.git -

Age	Commit message (Expand)	Author
3 days	Update logic in the OpenMP scheduler to exclude LITTLE coresHEAD main	Omar Al Khatib
3 days	Fix linking error to fp16_run_dequantization_core()	Ramy Elgammal
4 days	Refactor Dequantize to enable FP16 kernel in v8a multi_isa builds	Ramy Elgammal
5 days	Fix nightly build error	Pablo Marquez Tello
5 days	Rework CpuQuantizeKernel to enable FP16 in multi_isa builds	Ramy Elgammal
6 days	Refactor arm_gemm to enable FP16 in all multi_isa builds	Pablo Marquez Tello
6 days	Fix ReductionLayer FP16 for armv8a multi_isa builds	Ramy Elgammal
9 days	Improve CPU extension detection on macos	Viet-Hoa Do
10 days	ScatterND fix for scalar cases	Gunes Bayir
11 days	Make quantization rounding consistent	Jonathan Deakin
12 days	Add SME2 implementation of Softmax for QASYMM8 and QASYMM8_SIGNED.	Omar Al Khatib
12 days	Add batched indices support to Scatter GPU Implementation	Mohammed Suhail Munshi
2024-05-03	arm_gemm: fix SVE check on fast mode kernels.	David Mansell
2024-05-02	Change reorder implementation to be vector length agnostic for OHWIo8 reorder	Radu Salavat
2024-05-01	New SME2 heuristics.	David Mansell
2024-04-30	Add fp16 and integer data type support for ScatterNd in Gpu	Gunes Bayir
2024-04-29	Disable SME2 Gemmlowp s8f32 kernel selection in case results needs to be accu...	Gunes Bayir
2024-04-26	Disable SME2 Gemm kernel selection in case results needs to be accumulated	Gunes Bayir
2024-04-25	Add update/index/output (m+1)/2d/(m+n) support for CLScatter	Gunes Bayir
2024-04-25	Add padding to the shift and multipliers buffers	Pablo Marquez Tello
2024-04-22	Scatter GPU Kernel Implementation for 1D tensors.	Mohammed Suhail Munshi
2024-04-16	fix compilation errors on linux with gcc12	Sunita Nadampalli
2024-04-15	Add s8f32 kernels and dynamic QuantizationInfo	Jonathan Deakin
2024-04-12	Accumulation in Cpu Gemm kernels is not supported for quantized kernels in aa...	Radu Salavat
2024-04-11	Add SME2 implementation of softmax for FP16	Gunes Bayir
2024-04-11	Add in place summation to CPU GEMM kernels	Radu Salavat
2024-04-05	Fix compiler error	Pablo Marquez Tello
2024-04-04	Parallelise im2col along dimensions with higher number of iterations	Milos Puzovic
2024-04-02	Add SME2 implementation of softmax for FP32	Viet-Hoa Do
2024-03-27	Added new NEON fixed format fast math mode hybrid kernel with maximum height ...	Milos Puzovic
2024-03-25	Adds Tests and reference implementation for scatter operator with 1D tensors.	Mohammed Suhail Munshi
2024-03-21	Add skeleton for CLScatter op, reference and tests	Mohammed Suhail Munshi
2024-03-21	[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorc...	Renato Arantes
2024-03-20	Make Cpu/Gpu/Ref scalar/vectoral S32 division consistent	Gunes Bayir
2024-03-19	Fix overflow in NEMeanStdDevNormalizationKernel	Pablo Marquez Tello
2024-03-18	Fix quant. gemv kernel driver by adding set_quantized_bias()	Gunes Bayir
2024-03-14	arm_gemm: Fix bias handling for sme2 FP16 GEMV.	David Mansell
2024-03-14	Fix validation in pool2d assembly wrapper	Pablo Marquez Tello
2024-03-12	Optimize CpuSoftmaxKernel for axis != 0 and neon kernels	Omar Al Khatib
2024-03-12	Fix WoA nightly failure	Pablo Marquez Tello
2024-03-11	Prefer indirect Gemm vs. Direct convolution if supported	Gunes Bayir
2024-03-04	Disable FP16 on 32 bit	Pablo Marquez Tello
2024-03-04	Fix performance regression in fixed-format kernels	Gunes Bayir
2024-03-01	Set Neon™ as present for WoA	Pablo Marquez Tello
2024-02-22	Fix segfault in DWC in WoA	Pablo Marquez Tello
2024-02-22	Fix OpenBSD® build failure caused by patch 11144	Gunes Bayir
2024-02-21	Integrate new pretranspose_b_array with extra fused transpose of B	Gunes Bayir
2024-02-20	Requantization cases for offset changes only	Mohammed Suhail Munshi
2024-02-14	Fix compiler errors in cl-clang	Pablo Marquez Tello
2024-02-12	Fix parallel depthwise perf regression from 2db938c	Jonathan Deakin