index
:
ComputeLibrary.git
branches/arm_compute_19_02
branches/arm_compute_19_05
branches/arm_compute_19_08
branches/arm_compute_19_11
branches/arm_compute_20_02
branches/arm_compute_20_05
branches/arm_compute_20_08
branches/arm_compute_20_11
branches/arm_compute_21_02
branches/arm_compute_21_05
branches/arm_compute_21_08
branches/arm_compute_21_11
branches/arm_compute_22_02
branches/arm_compute_22_05
branches/arm_compute_22_08
branches/arm_compute_22_11
branches/arm_compute_23_02
branches/arm_compute_23_02_1
branches/arm_compute_23_05
branches/arm_compute_23_05_1
branches/arm_compute_23_08
branches/arm_compute_23_11
branches/arm_compute_24_01
branches/arm_compute_24_02
branches/arm_compute_24_02_1
branches/arm_compute_24_04
dev/21_02_int8_optim
dev/21_05_int8_optim
main
master
release_candidate
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
cpu
Age
Commit message (
Expand
)
Author
2024-04-15
Add s8f32 kernels and dynamic QuantizationInfo
Jonathan Deakin
2024-04-12
Accumulation in Cpu Gemm kernels is not supported for quantized kernels in aa...
Radu Salavat
2024-04-11
Add SME2 implementation of softmax for FP16
Gunes Bayir
2024-04-11
Add in place summation to CPU GEMM kernels
Radu Salavat
2024-04-04
Parallelise im2col along dimensions with higher number of iterations
Milos Puzovic
2024-04-02
Add SME2 implementation of softmax for FP32
Viet-Hoa Do
2024-03-21
[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorc...
Renato Arantes
2024-03-20
Make Cpu/Gpu/Ref scalar/vectoral S32 division consistent
Gunes Bayir
2024-03-19
Fix overflow in NEMeanStdDevNormalizationKernel
Pablo Marquez Tello
2024-03-14
Fix validation in pool2d assembly wrapper
Pablo Marquez Tello
2024-03-12
Optimize CpuSoftmaxKernel for axis != 0 and neon kernels
Omar Al Khatib
2024-03-11
Prefer indirect Gemm vs. Direct convolution if supported
Gunes Bayir
2024-03-04
Fix performance regression in fixed-format kernels
Gunes Bayir
2024-02-21
Integrate new pretranspose_b_array with extra fused transpose of B
Gunes Bayir
2024-02-20
Requantization cases for offset changes only
Mohammed Suhail Munshi
2024-02-12
Fix parallel depthwise perf regression from 2db938c
Jonathan Deakin
2024-02-07
Parallelize CPU depthwise over batch if only 1 row
Jonathan Deakin
2024-02-05
Fix leftover cols in CpuGemmLowpMatrixBReductionKernel
Jonathan Deakin
2024-01-23
Fix for Logically dead code detected in Coverity checks
Anitha Raj
2024-01-10
Use look up table for fp16 activation
Mohammed Suhail Munshi
2024-01-04
Prevent RELU from being processed thru LUT in INT8
Sangwon Ha
2023-12-12
Winograd changes to enable fp16 in armv8a multi_isa builds
Pablo Marquez Tello
2023-12-07
Optimize CPU depth-to-space
Viet-Hoa Do
2023-12-06
Revert "thread_local _custom_scheduler"
Pablo Marquez Tello
2023-12-05
Optimize CpuSoftmaxKernel for axis=0
Gunes Bayir
2023-11-27
BatchNorm changes to enable fp16 in armv8a multi_isa builds
Pablo Marquez Tello
2023-11-27
CpuMul changes to enable fp16 in armv8a multi_isa builds
Pablo Marquez Tello
2023-11-24
thread_local _custom_scheduler
David Svantesson
2023-11-16
NormalizationLayer changes to enable fp16 in armv8a multi_isa builds
Pablo Marquez Tello
2023-11-15
Fix various coverity issues
SiCong Li
2023-11-10
Fix CpuGemmConv2d int8 segfault
SiCong Li
2023-11-09
Pooling changes to enable fp16 in armv8a multi_isa builds
Pablo Marquez Tello
2023-11-09
DepthwiseConvolution changes to enable fp16 in armv8a multi_isa builds
Pablo Marquez Tello
2023-11-08
Optimize CpuGemmConv2d start-up time
SiCong Li
2023-10-30
DirectConv and Im2Col changes to enable fp16 in armv8a multi_isa builds
Pablo Marquez Tello
2023-10-20
FuseBatchNorm changes to enable fp16 in armv8a multi_isa builds
Pablo Marquez Tello
2023-10-13
Fix build error in CpuScale
Pablo Marquez Tello
2023-10-12
Scale changes to enable fp16 in armv8a multi_isa builds
Pablo Marquez Tello
2023-10-10
Fix build error
Pablo Marquez Tello
2023-10-10
CpuSubKernel changes to enable fp16 in armv8a multi_isa builds
Pablo Marquez Tello
2023-10-09
Pool2d changes to enable fp16 in armv8a multi_isa builds
Pablo Marquez Tello
2023-10-05
Optimize CLTranspose operator
Jakub Sujak
2023-10-02
Optimize CL and Neon Winograd tests
Gunes Bayir
2023-09-28
Apply clang-format on repository
Felix Thomasmathibalan
2023-09-26
Re-arrange header inclusion order
Felix Thomasmathibalan
2023-09-26
Select changes to enable fp16 in armv8a multi_isa builds
Pablo Marquez Tello
2023-09-26
Maxunpooling changes to enable fp16 in armv8a multi_isa builds
Pablo Marquez Tello
2023-09-21
L2Norm changes to enable fp16 in armv8a multi_isa builds
Pablo Marquez Tello
2023-09-21
Gemm changes to enable fp16 in armv8a multi_isa builds
Pablo Marquez Tello
2023-09-20
Fix the validation issue in AddMulAdd fused kernel
Gunes Bayir
[next]