ComputeLibrary.git -

Age	Commit message (Expand)	Author
2024-03-27	Added new NEON fixed format fast math mode hybrid kernel with maximum height ...	Milos Puzovic
2024-03-25	Adds Tests and reference implementation for scatter operator with 1D tensors.	Mohammed Suhail Munshi
2024-03-21	Add skeleton for CLScatter op, reference and tests	Mohammed Suhail Munshi
2024-03-21	[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorc...	Renato Arantes
2024-03-20	Make Cpu/Gpu/Ref scalar/vectoral S32 division consistent	Gunes Bayir
2024-03-19	Fix overflow in NEMeanStdDevNormalizationKernel	Pablo Marquez Tello
2024-03-18	Fix quant. gemv kernel driver by adding set_quantized_bias()	Gunes Bayir
2024-03-14	arm_gemm: Fix bias handling for sme2 FP16 GEMV.	David Mansell
2024-03-14	Fix validation in pool2d assembly wrapper	Pablo Marquez Tello
2024-03-12	Optimize CpuSoftmaxKernel for axis != 0 and neon kernels	Omar Al Khatib
2024-03-12	Fix WoA nightly failure	Pablo Marquez Tello
2024-03-11	Prefer indirect Gemm vs. Direct convolution if supported	Gunes Bayir
2024-03-04	Disable FP16 on 32 bit	Pablo Marquez Tello
2024-03-04	Fix performance regression in fixed-format kernels	Gunes Bayir
2024-03-01	Set Neon™ as present for WoA	Pablo Marquez Tello
2024-02-22	Fix segfault in DWC in WoA	Pablo Marquez Tello
2024-02-22	Fix OpenBSD® build failure caused by patch 11144	Gunes Bayir
2024-02-21	Integrate new pretranspose_b_array with extra fused transpose of B	Gunes Bayir
2024-02-20	Requantization cases for offset changes only	Mohammed Suhail Munshi
2024-02-14	Fix compiler errors in cl-clang	Pablo Marquez Tello
2024-02-12	Fix parallel depthwise perf regression from 2db938c	Jonathan Deakin
2024-02-09	Add support for QSYMM8 in ClCastKernel	Pablo Marquez Tello
2024-02-09	Remove CKW prototype and Template Writer	Gunes Bayir
2024-02-08	Fix the bug in GpuTanh operator in dynamic fusion	Gunes Bayir
2024-02-08	Mark GpuSoftmax and GpuReshape as not supported	Gunes Bayir
2024-02-07	Parallelize CPU depthwise over batch if only 1 row	Jonathan Deakin
2024-02-06	arm_gemm: SME: Remove artificial single-thread constraint on quantized int8 k...	David Mansell
2024-02-05	Fix leftover cols in CpuGemmLowpMatrixBReductionKernel	Jonathan Deakin
2024-02-01	Use the stable CKW API in the GPU dynamic fusion backend	Gunes Bayir
2024-01-25	arm_gemm: convolution: optimize convolver.hpp.	David Mansell
2024-01-23	Fix for Logically dead code detected in Coverity checks	Anitha Raj
2024-01-23	Fix for unchecked return value detected in Coverity checks.	Anitha Raj
2024-01-23	Make GpuWorkloadContext own all tensor info objects	Viet-Hoa Do
2024-01-18	Fix divide-by-zero compilation error	Viet-Hoa Do
2024-01-17	Fix minor issue, clean lut code	Mohammed Suhail Munshi
2024-01-12	Fix potential threading issue in LUTManager	Mohammed Suhail Munshi
2024-01-12	[ONCPUML-1387] Add ACL based reorder for f32 to bf16 data type conversion.	Renato Arantes
2024-01-10	Fix compilation error on GCC 13.2	Jakub Sujak
2024-01-10	Use look up table for fp16 activation	Mohammed Suhail Munshi
2024-01-04	Prevent RELU from being processed thru LUT in INT8	Sangwon Ha
2023-12-22	Fix nightly issue caused by gemm_reshaped_only_rhs_mmul kernel	Gunes Bayir
2023-12-22	Add Mali™-G720 and Mali™-G620 as GpuTargets	Gunes Bayir
2023-12-15	Fix nightly bug caused by not validation 3d cases for input tensor	Gunes Bayir
2023-12-15	Revert "Fix nightly bug caused by wrong validation in Gemm mmul kernel"	Gunes Bayir
2023-12-14	Fix validation error in CL generate proposals kernel	Gunes Bayir
2023-12-13	Fix nightly bug caused by wrong validation in Gemm mmul kernel	Gunes Bayir
2023-12-12	Winograd changes to enable fp16 in armv8a multi_isa builds	Pablo Marquez Tello
2023-12-08	Fix validation error in graph_ssd_mobilenet	Gunes Bayir
2023-12-08	Fix unit tests failing in CL/UNIT/TensorAllocator	Gunes Bayir
2023-12-07	Optimize CPU depth-to-space	Viet-Hoa Do