index
:
ComputeLibrary.git
branches/arm_compute_19_02
branches/arm_compute_19_05
branches/arm_compute_19_08
branches/arm_compute_19_11
branches/arm_compute_20_02
branches/arm_compute_20_05
branches/arm_compute_20_08
branches/arm_compute_20_11
branches/arm_compute_21_02
branches/arm_compute_21_05
branches/arm_compute_21_08
branches/arm_compute_21_11
branches/arm_compute_22_02
branches/arm_compute_22_05
branches/arm_compute_22_08
branches/arm_compute_22_11
branches/arm_compute_23_02
branches/arm_compute_23_02_1
branches/arm_compute_23_05
branches/arm_compute_23_05_1
branches/arm_compute_23_08
branches/arm_compute_23_11
branches/arm_compute_24_01
branches/arm_compute_24_02
branches/arm_compute_24_02_1
branches/arm_compute_24_04
branches/arm_compute_24_05
dev/21_02_int8_optim
dev/21_05_int8_optim
main
master
release_candidate
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
Age
Commit message (
Expand
)
Author
37 hours
Improve CPU extension detection on macos
HEAD
release_candidate
main
Viet-Hoa Do
43 hours
ScatterND fix for scalar cases
Gunes Bayir
4 days
Make quantization rounding consistent
Jonathan Deakin
4 days
Add SME2 implementation of Softmax for QASYMM8 and QASYMM8_SIGNED.
branches/arm_compute_24_05
Omar Al Khatib
4 days
Add batched indices support to Scatter GPU Implementation
Mohammed Suhail Munshi
9 days
arm_gemm: fix SVE check on fast mode kernels.
David Mansell
10 days
Change reorder implementation to be vector length agnostic for OHWIo8 reorder
Radu Salavat
11 days
New SME2 heuristics.
David Mansell
12 days
Add fp16 and integer data type support for ScatterNd in Gpu
Gunes Bayir
13 days
Disable SME2 Gemmlowp s8f32 kernel selection in case results needs to be accu...
Gunes Bayir
2024-04-26
Disable SME2 Gemm kernel selection in case results needs to be accumulated
Gunes Bayir
2024-04-25
Add update/index/output (m+1)/2d/(m+n) support for CLScatter
Gunes Bayir
2024-04-25
Add padding to the shift and multipliers buffers
Pablo Marquez Tello
2024-04-22
Scatter GPU Kernel Implementation for 1D tensors.
Mohammed Suhail Munshi
2024-04-16
fix compilation errors on linux with gcc12
Sunita Nadampalli
2024-04-15
Add s8f32 kernels and dynamic QuantizationInfo
Jonathan Deakin
2024-04-12
Accumulation in Cpu Gemm kernels is not supported for quantized kernels in aa...
Radu Salavat
2024-04-11
Add SME2 implementation of softmax for FP16
Gunes Bayir
2024-04-11
Add in place summation to CPU GEMM kernels
Radu Salavat
2024-04-05
Fix compiler error
Pablo Marquez Tello
2024-04-04
Parallelise im2col along dimensions with higher number of iterations
Milos Puzovic
2024-04-02
Add SME2 implementation of softmax for FP32
Viet-Hoa Do
2024-03-27
Added new NEON fixed format fast math mode hybrid kernel with maximum height ...
Milos Puzovic
2024-03-25
Adds Tests and reference implementation for scatter operator with 1D tensors.
Mohammed Suhail Munshi
2024-03-21
Add skeleton for CLScatter op, reference and tests
Mohammed Suhail Munshi
2024-03-21
[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorc...
Renato Arantes
2024-03-20
Make Cpu/Gpu/Ref scalar/vectoral S32 division consistent
Gunes Bayir
2024-03-19
Fix overflow in NEMeanStdDevNormalizationKernel
Pablo Marquez Tello
2024-03-18
Fix quant. gemv kernel driver by adding set_quantized_bias()
Gunes Bayir
2024-03-14
arm_gemm: Fix bias handling for sme2 FP16 GEMV.
David Mansell
2024-03-14
Fix validation in pool2d assembly wrapper
Pablo Marquez Tello
2024-03-12
Optimize CpuSoftmaxKernel for axis != 0 and neon kernels
Omar Al Khatib
2024-03-12
Fix WoA nightly failure
Pablo Marquez Tello
2024-03-11
Prefer indirect Gemm vs. Direct convolution if supported
Gunes Bayir
2024-03-04
Disable FP16 on 32 bit
Pablo Marquez Tello
2024-03-04
Fix performance regression in fixed-format kernels
Gunes Bayir
2024-03-01
Set Neon™ as present for WoA
Pablo Marquez Tello
2024-02-22
Fix segfault in DWC in WoA
Pablo Marquez Tello
2024-02-22
Fix OpenBSD® build failure caused by patch 11144
Gunes Bayir
2024-02-21
Integrate new pretranspose_b_array with extra fused transpose of B
Gunes Bayir
2024-02-20
Requantization cases for offset changes only
Mohammed Suhail Munshi
2024-02-14
Fix compiler errors in cl-clang
Pablo Marquez Tello
2024-02-12
Fix parallel depthwise perf regression from 2db938c
Jonathan Deakin
2024-02-09
Add support for QSYMM8 in ClCastKernel
Pablo Marquez Tello
2024-02-09
Remove CKW prototype and Template Writer
Gunes Bayir
2024-02-08
Fix the bug in GpuTanh operator in dynamic fusion
Gunes Bayir
2024-02-08
Mark GpuSoftmax and GpuReshape as not supported
Gunes Bayir
2024-02-07
Parallelize CPU depthwise over batch if only 1 row
Jonathan Deakin
2024-02-06
arm_gemm: SME: Remove artificial single-thread constraint on quantized int8 k...
David Mansell
2024-02-05
Fix leftover cols in CpuGemmLowpMatrixBReductionKernel
Jonathan Deakin
[next]