index
:
ComputeLibrary.git
branches/arm_compute_19_02
branches/arm_compute_19_05
branches/arm_compute_19_08
branches/arm_compute_19_11
branches/arm_compute_20_02
branches/arm_compute_20_05
branches/arm_compute_20_08
branches/arm_compute_20_11
branches/arm_compute_21_02
branches/arm_compute_21_05
branches/arm_compute_21_08
branches/arm_compute_21_11
branches/arm_compute_22_02
branches/arm_compute_22_05
branches/arm_compute_22_08
branches/arm_compute_22_11
branches/arm_compute_23_02
branches/arm_compute_23_02_1
branches/arm_compute_23_05
branches/arm_compute_23_05_1
branches/arm_compute_23_08
branches/arm_compute_23_11
branches/arm_compute_24_01
branches/arm_compute_24_02
branches/arm_compute_24_02_1
branches/arm_compute_24_04
branches/arm_compute_24_05
branches/arm_compute_24_06
branches/arm_compute_24_07
dev/21_02_int8_optim
dev/21_05_int8_optim
main
master
release_candidate
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
cpu
/
operators
Age
Commit message (
Expand
)
Author
2024-04-15
Add s8f32 kernels and dynamic QuantizationInfo
Jonathan Deakin
2024-04-12
Accumulation in Cpu Gemm kernels is not supported for quantized kernels in aa...
Radu Salavat
2024-04-11
Add in place summation to CPU GEMM kernels
Radu Salavat
2024-04-04
Parallelise im2col along dimensions with higher number of iterations
Milos Puzovic
2024-03-21
[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorc...
Renato Arantes
2024-03-12
Optimize CpuSoftmaxKernel for axis != 0 and neon kernels
Omar Al Khatib
2024-03-11
Prefer indirect Gemm vs. Direct convolution if supported
Gunes Bayir
2024-03-04
Fix performance regression in fixed-format kernels
Gunes Bayir
2024-02-21
Integrate new pretranspose_b_array with extra fused transpose of B
Gunes Bayir
2024-02-20
Requantization cases for offset changes only
Mohammed Suhail Munshi
2024-02-12
Fix parallel depthwise perf regression from 2db938c
Jonathan Deakin
2024-02-07
Parallelize CPU depthwise over batch if only 1 row
Jonathan Deakin
2023-12-12
Winograd changes to enable fp16 in armv8a multi_isa builds
Pablo Marquez Tello
2023-12-06
Revert "thread_local _custom_scheduler"
Pablo Marquez Tello
2023-12-05
Optimize CpuSoftmaxKernel for axis=0
Gunes Bayir
2023-11-24
thread_local _custom_scheduler
David Svantesson
2023-11-15
Fix various coverity issues
SiCong Li
2023-11-10
Fix CpuGemmConv2d int8 segfault
SiCong Li
2023-11-08
Optimize CpuGemmConv2d start-up time
SiCong Li
2023-10-02
Optimize CL and Neon Winograd tests
Gunes Bayir
2023-09-28
Apply clang-format on repository
Felix Thomasmathibalan
2023-09-20
Fix the validation issue in AddMulAdd fused kernel
Gunes Bayir
2023-09-15
Remove deprecated support for BF16 in CpuCast
Adnan AlSinan
2023-09-15
Fix include dependencies for mass reformatting patch
Gunes Bayir
2023-09-04
Remove legacy PostOps code
Jakub Sujak
2023-08-23
Update CpuGemmConv2d and CpuFlatten to use CpuReshape operator
Anitha Raj
2023-08-22
Optimize CpuReshapeKernel
Anitha Raj
2023-08-17
Fix various static check issues
Viet-Hoa Do
2023-08-07
Document the Conv2D heuristic
Gian Marco Iodice
2023-07-28
Retain back-compatibility for arm_compute/core/Types.h
SiCong Li
2023-07-19
Add support for input S64/U64 in CpuCastKernel
Pablo Marquez Tello
2023-07-10
Do not include headers necessary for logging when logging is disabled
Matthew Bentham
2023-07-04
Depthwise channel pre-multiplication
Michael Tyler
2023-06-23
Address the issues with the ACL coverage pipeline failures related to matmul.
Renato Arantes
2023-06-16
Add Fused Activation to OpenCL MatMul
Mohammed Suhail Munshi
2023-06-15
Break up Utils.h a bit to reduce unused code being included everywhere
Matthew Bentham
2023-06-15
Break up arm_compute/core/Types.h a bit
Matthew Bentham
2023-05-10
Re-enable dyanmic weights in Neon™ depthwise convolution
Ramy Elgammal
2023-05-05
Connect CLMatMul function to quantized kernels and resolve NE BatchMatMul int...
Jakub Sujak
2023-05-05
Disable dynamic weights in unsupported operators
Viet-Hoa Do
2023-05-03
Fix im2col for fast-maths mode with padding.
Renato Arantes
2023-05-03
Fix CPU MatMul broadcast detection
Viet-Hoa Do
2023-05-02
Fix fully connected and matmul mismatches
Viet-Hoa Do
2023-04-26
Integrate multi-threaded pretranspose_B_array
SiCong Li
2023-04-19
Add quantized support for CPU MatMul
Viet-Hoa Do
2023-04-14
Fix dynamic weights for CPU connected layer
Viet-Hoa Do
2023-04-13
Implement MatMul Function and Operator with Floating Point support for CPU
Mohammed Suhail Munshi
2023-03-21
Add dynamic weights for CPU fully connected layer
Viet-Hoa Do
2023-03-13
[ONCPUML-1174] Allow src/weights mismatch for fixed format
Jonathan Deakin
2023-03-03
NEGEMMLowpMatrixMultiplyCore should be configured for optimized int8 kernel.
Ethan Doe
[next]