ComputeLibrary.git -

Age	Commit message (Expand)	Author
2024-04-15	Add s8f32 kernels and dynamic QuantizationInfo	Jonathan Deakin
2024-04-12	Accumulation in Cpu Gemm kernels is not supported for quantized kernels in aa...	Radu Salavat
2024-04-11	Add in place summation to CPU GEMM kernels	Radu Salavat
2024-04-04	Parallelise im2col along dimensions with higher number of iterations	Milos Puzovic
2024-03-21	[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorc...	Renato Arantes
2024-03-12	Optimize CpuSoftmaxKernel for axis != 0 and neon kernels	Omar Al Khatib
2024-03-11	Prefer indirect Gemm vs. Direct convolution if supported	Gunes Bayir
2024-03-04	Fix performance regression in fixed-format kernels	Gunes Bayir
2024-02-21	Integrate new pretranspose_b_array with extra fused transpose of B	Gunes Bayir
2024-02-20	Requantization cases for offset changes only	Mohammed Suhail Munshi
2024-02-12	Fix parallel depthwise perf regression from 2db938c	Jonathan Deakin
2024-02-07	Parallelize CPU depthwise over batch if only 1 row	Jonathan Deakin
2023-12-12	Winograd changes to enable fp16 in armv8a multi_isa builds	Pablo Marquez Tello
2023-12-06	Revert "thread_local _custom_scheduler"	Pablo Marquez Tello
2023-12-05	Optimize CpuSoftmaxKernel for axis=0	Gunes Bayir
2023-11-24	thread_local _custom_scheduler	David Svantesson
2023-11-15	Fix various coverity issues	SiCong Li
2023-11-10	Fix CpuGemmConv2d int8 segfault	SiCong Li
2023-11-08	Optimize CpuGemmConv2d start-up time	SiCong Li
2023-10-02	Optimize CL and Neon Winograd tests	Gunes Bayir
2023-09-28	Apply clang-format on repository	Felix Thomasmathibalan
2023-09-20	Fix the validation issue in AddMulAdd fused kernel	Gunes Bayir
2023-09-15	Remove deprecated support for BF16 in CpuCast	Adnan AlSinan
2023-09-15	Fix include dependencies for mass reformatting patch	Gunes Bayir
2023-09-04	Remove legacy PostOps code	Jakub Sujak
2023-08-23	Update CpuGemmConv2d and CpuFlatten to use CpuReshape operator	Anitha Raj
2023-08-22	Optimize CpuReshapeKernel	Anitha Raj
2023-08-17	Fix various static check issues	Viet-Hoa Do
2023-08-07	Document the Conv2D heuristic	Gian Marco Iodice
2023-07-28	Retain back-compatibility for arm_compute/core/Types.h	SiCong Li
2023-07-19	Add support for input S64/U64 in CpuCastKernel	Pablo Marquez Tello
2023-07-10	Do not include headers necessary for logging when logging is disabled	Matthew Bentham
2023-07-04	Depthwise channel pre-multiplication	Michael Tyler
2023-06-23	Address the issues with the ACL coverage pipeline failures related to matmul.	Renato Arantes
2023-06-16	Add Fused Activation to OpenCL MatMul	Mohammed Suhail Munshi
2023-06-15	Break up Utils.h a bit to reduce unused code being included everywhere	Matthew Bentham
2023-06-15	Break up arm_compute/core/Types.h a bit	Matthew Bentham
2023-05-10	Re-enable dyanmic weights in Neon™ depthwise convolution	Ramy Elgammal
2023-05-05	Connect CLMatMul function to quantized kernels and resolve NE BatchMatMul int...	Jakub Sujak
2023-05-05	Disable dynamic weights in unsupported operators	Viet-Hoa Do
2023-05-03	Fix im2col for fast-maths mode with padding.	Renato Arantes
2023-05-03	Fix CPU MatMul broadcast detection	Viet-Hoa Do
2023-05-02	Fix fully connected and matmul mismatches	Viet-Hoa Do
2023-04-26	Integrate multi-threaded pretranspose_B_array	SiCong Li
2023-04-19	Add quantized support for CPU MatMul	Viet-Hoa Do
2023-04-14	Fix dynamic weights for CPU connected layer	Viet-Hoa Do
2023-04-13	Implement MatMul Function and Operator with Floating Point support for CPU	Mohammed Suhail Munshi
2023-03-21	Add dynamic weights for CPU fully connected layer	Viet-Hoa Do
2023-03-13	[ONCPUML-1174] Allow src/weights mismatch for fixed format	Jonathan Deakin
2023-03-03	NEGEMMLowpMatrixMultiplyCore should be configured for optimized int8 kernel.	Ethan Doe