index
:
ComputeLibrary.git
branches/arm_compute_19_02
branches/arm_compute_19_05
branches/arm_compute_19_08
branches/arm_compute_19_11
branches/arm_compute_20_02
branches/arm_compute_20_05
branches/arm_compute_20_08
branches/arm_compute_20_11
branches/arm_compute_21_02
branches/arm_compute_21_05
branches/arm_compute_21_08
branches/arm_compute_21_11
branches/arm_compute_22_02
branches/arm_compute_22_05
branches/arm_compute_22_08
branches/arm_compute_22_11
branches/arm_compute_23_02
branches/arm_compute_23_02_1
branches/arm_compute_23_05
branches/arm_compute_23_05_1
branches/arm_compute_23_08
branches/arm_compute_23_11
branches/arm_compute_24_01
branches/arm_compute_24_02
branches/arm_compute_24_02_1
branches/arm_compute_24_04
branches/arm_compute_24_05
branches/arm_compute_24_06
dev/21_02_int8_optim
dev/21_05_int8_optim
main
master
release_candidate
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
Age
Commit message (
Expand
)
Author
2024-04-16
fix compilation errors on linux with gcc12
Sunita Nadampalli
2024-04-15
Add s8f32 kernels and dynamic QuantizationInfo
Jonathan Deakin
2024-04-12
Accumulation in Cpu Gemm kernels is not supported for quantized kernels in aa...
Radu Salavat
2024-04-11
Add SME2 implementation of softmax for FP16
Gunes Bayir
2024-04-11
Add in place summation to CPU GEMM kernels
Radu Salavat
2024-04-05
Fix compiler error
Pablo Marquez Tello
2024-04-04
Parallelise im2col along dimensions with higher number of iterations
Milos Puzovic
2024-04-02
Add SME2 implementation of softmax for FP32
Viet-Hoa Do
2024-03-27
Added new NEON fixed format fast math mode hybrid kernel with maximum height ...
Milos Puzovic
2024-03-25
Adds Tests and reference implementation for scatter operator with 1D tensors.
Mohammed Suhail Munshi
2024-03-21
Add skeleton for CLScatter op, reference and tests
Mohammed Suhail Munshi
2024-03-21
[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorc...
Renato Arantes
2024-03-20
Make Cpu/Gpu/Ref scalar/vectoral S32 division consistent
Gunes Bayir
2024-03-19
Fix overflow in NEMeanStdDevNormalizationKernel
Pablo Marquez Tello
2024-03-18
Fix quant. gemv kernel driver by adding set_quantized_bias()
Gunes Bayir
2024-03-14
arm_gemm: Fix bias handling for sme2 FP16 GEMV.
David Mansell
2024-03-14
Fix validation in pool2d assembly wrapper
Pablo Marquez Tello
2024-03-12
Optimize CpuSoftmaxKernel for axis != 0 and neon kernels
Omar Al Khatib
2024-03-12
Fix WoA nightly failure
Pablo Marquez Tello
2024-03-11
Prefer indirect Gemm vs. Direct convolution if supported
Gunes Bayir
2024-03-04
Disable FP16 on 32 bit
Pablo Marquez Tello
2024-03-04
Fix performance regression in fixed-format kernels
Gunes Bayir
2024-03-01
Set Neon™ as present for WoA
Pablo Marquez Tello
2024-02-22
Fix segfault in DWC in WoA
Pablo Marquez Tello
2024-02-22
Fix OpenBSD® build failure caused by patch 11144
Gunes Bayir
2024-02-21
Integrate new pretranspose_b_array with extra fused transpose of B
Gunes Bayir
2024-02-20
Requantization cases for offset changes only
Mohammed Suhail Munshi
2024-02-14
Fix compiler errors in cl-clang
Pablo Marquez Tello
2024-02-12
Fix parallel depthwise perf regression from 2db938c
Jonathan Deakin
2024-02-09
Add support for QSYMM8 in ClCastKernel
Pablo Marquez Tello
2024-02-09
Remove CKW prototype and Template Writer
Gunes Bayir
2024-02-08
Fix the bug in GpuTanh operator in dynamic fusion
Gunes Bayir
2024-02-08
Mark GpuSoftmax and GpuReshape as not supported
Gunes Bayir
2024-02-07
Parallelize CPU depthwise over batch if only 1 row
Jonathan Deakin
2024-02-06
arm_gemm: SME: Remove artificial single-thread constraint on quantized int8 k...
David Mansell
2024-02-05
Fix leftover cols in CpuGemmLowpMatrixBReductionKernel
Jonathan Deakin
2024-02-01
Use the stable CKW API in the GPU dynamic fusion backend
Gunes Bayir
2024-01-25
arm_gemm: convolution: optimize convolver.hpp.
David Mansell
2024-01-23
Fix for Logically dead code detected in Coverity checks
Anitha Raj
2024-01-23
Fix for unchecked return value detected in Coverity checks.
Anitha Raj
2024-01-23
Make GpuWorkloadContext own all tensor info objects
Viet-Hoa Do
2024-01-18
Fix divide-by-zero compilation error
Viet-Hoa Do
2024-01-17
Fix minor issue, clean lut code
Mohammed Suhail Munshi
2024-01-12
Fix potential threading issue in LUTManager
Mohammed Suhail Munshi
2024-01-12
[ONCPUML-1387] Add ACL based reorder for f32 to bf16 data type conversion.
Renato Arantes
2024-01-10
Fix compilation error on GCC 13.2
Jakub Sujak
2024-01-10
Use look up table for fp16 activation
Mohammed Suhail Munshi
2024-01-04
Prevent RELU from being processed thru LUT in INT8
Sangwon Ha
2023-12-22
Fix nightly issue caused by gemm_reshaped_only_rhs_mmul kernel
Gunes Bayir
2023-12-22
Add Mali™-G720 and Mali™-G620 as GpuTargets
Gunes Bayir
[next]