index
:
ComputeLibrary.git
branches/arm_compute_19_02
branches/arm_compute_19_05
branches/arm_compute_19_08
branches/arm_compute_19_11
branches/arm_compute_20_02
branches/arm_compute_20_05
branches/arm_compute_20_08
branches/arm_compute_20_11
branches/arm_compute_21_02
branches/arm_compute_21_05
branches/arm_compute_21_08
branches/arm_compute_21_11
branches/arm_compute_22_02
branches/arm_compute_22_05
branches/arm_compute_22_08
branches/arm_compute_22_11
branches/arm_compute_23_02
branches/arm_compute_23_02_1
branches/arm_compute_23_05
branches/arm_compute_23_05_1
branches/arm_compute_23_08
branches/arm_compute_23_11
branches/arm_compute_24_01
branches/arm_compute_24_02
branches/arm_compute_24_02_1
branches/arm_compute_24_04
branches/arm_compute_24_05
branches/arm_compute_24_06
branches/arm_compute_24_07
dev/21_02_int8_optim
dev/21_05_int8_optim
main
master
release_candidate
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
gpu
/
cl
/
ClKernelLibrary.cpp
Age
Commit message (
Expand
)
Author
2023-06-29
Implement FP32/16 MatMul Lhs T Rhs T/NT kernel using MMUL extension
Gunes Bayir
2023-06-23
Implement FP32/FP16 MatMul NT/T kernel using the MMUL extension
Ramy Elgammal
2023-06-19
Implement FP32/FP16 MatMul NT/NT kernel using the MMUL extension
SiCong Li
2023-04-27
Add quantized CL MatMul kernel for LHS NT, RHS T
Jakub Sujak
2023-04-20
Implement CL kernel for a native batched matmul Quantized - LHS transposed, R...
Omar Al Khatib
2023-04-17
Add quantized CL MatMul kernels for Lhs NT/T, Rhs NT
Gunes Bayir
2023-03-20
Implement OpenCL MatMul for Lhs T Rhs T/NT FP32/16
Gunes Bayir
2023-03-17
Implementation of RSQRT for quantized int8
Ramy Elgammal
2023-03-17
Implement OpenCL MatMul for Lhs NT Rhs T/NT FP32/16
Ramy Elgammal
2022-12-13
Add CLAMP operator to Dynamic Fusion interface
Jakub Sujak
2022-12-09
Implement the OpenCL kernel to compute the indirect convolution
Gian Marco Iodice
2022-11-25
Implement address precalculation for indirect conv2d - OpenCL
Gian Marco Iodice
2022-11-14
Optimize Transposed Convolution for CL backend (FP32/16)
Gunes Bayir
2022-07-22
Add GemmLowp MMUL Reshaped Only Rhs Support for QASYMM8/QASYMM8_SIGNED
Freddie Liardet
2022-07-13
Add Gemm MMUL Reshaped Only Rhs Support for FP32/FP16
Gunes Bayir
2022-04-19
Add CLPool3d Int8 Support
Mohammed Suhail Munshi
2022-04-14
Include missing embedded headers
SiCong Li
2022-03-31
Fix embedded kernel header inclusion for dynamic fusion
Giorgio Arena
2022-03-15
Implementation of ClPooling3d
ramelg01
2022-02-09
Remove deprecated remap functions.
Adnan AlSinan
2022-02-02
Revert "Rework gemm_mm_reshaped_only_rhs_ kernels with new macros"
Ramy Elgammal
2022-01-25
Rework gemm_mm_reshaped_only_rhs_ kernels with new macros
Gian Marco Iodice
2021-12-13
Remove padding from ClDirectConv2dKernel
Adnan AlSinan
2021-11-20
Improve start-up timer for GeMM (floating-point):
ramelg01
2021-11-02
Add post ops to ClGemmMatrixMultiplyReshapedOnlyRHSKernel and ClGemmMatrixMul...
SiCongLi
2021-10-28
Add experimental PostOp interface to ClGemmMatrixMultiplyReshapedKernel Part 1
SiCongLi
2021-10-18
Remove legacy GeMM kernels on OpenCL
Gian Marco Iodice
2021-10-14
Implement CLDirectConv3D f32/f16
Giorgio Arena
2021-09-03
Remove padding from ClPool2dKernel NCHW
Giorgio Arena
2021-08-25
Move CPU/GPU files from Core/Runtime to the respective backend folders
Georgios Pinitas