index
:
ComputeLibrary.git
branches/arm_compute_19_02
branches/arm_compute_19_05
branches/arm_compute_19_08
branches/arm_compute_19_11
branches/arm_compute_20_02
branches/arm_compute_20_05
branches/arm_compute_20_08
branches/arm_compute_20_11
branches/arm_compute_21_02
branches/arm_compute_21_05
branches/arm_compute_21_08
branches/arm_compute_21_11
branches/arm_compute_22_02
branches/arm_compute_22_05
branches/arm_compute_22_08
branches/arm_compute_22_11
branches/arm_compute_23_02
branches/arm_compute_23_02_1
branches/arm_compute_23_05
branches/arm_compute_23_05_1
branches/arm_compute_23_08
branches/arm_compute_23_11
branches/arm_compute_24_01
branches/arm_compute_24_02
branches/arm_compute_24_02_1
branches/arm_compute_24_04
branches/arm_compute_24_05
branches/arm_compute_24_06
branches/arm_compute_24_07
branches/arm_compute_24_08
branches/arm_compute_24_08_1
dev/21_02_int8_optim
dev/21_05_int8_optim
main
master
release_candidate
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
core
/
CL
/
cl_kernels
Age
Commit message (
Expand
)
Author
2022-07-22
Add GemmLowp MMUL Reshaped Only Rhs Support for QASYMM8/QASYMM8_SIGNED
Freddie Liardet
2022-07-21
Fix direct convolution cases that were failing on Odroid
Adnan AlSinan
2022-07-13
Add Gemm MMUL Reshaped Only Rhs Support for FP32/FP16
Gunes Bayir
2022-07-05
Add G57 to GPUTarget
SiCong Li
2022-06-27
Implement new Elementwise Dynamic Fusion Operators: Div, Floor
Michalis Spyrou
2022-06-15
Fix performance regression in Winograd Output Transform (OpenCL)
Gian Marco Iodice
2022-05-31
Add cl_khr_integer_dot_product extension support
Viet-Hoa Do
2022-05-09
Mismatches in dynamically fused direct conv2d + add kernel
Michalis Spyrou
2022-04-19
Add CLPool3d Int8 Support
Mohammed Suhail Munshi
2022-04-14
Include missing embedded headers
SiCong Li
2022-03-15
Implementation of ClPooling3d
ramelg01
2022-03-08
Merge kernel prototype patch
Giorgio Arena
2022-02-11
Improve start-up time for concatenation layers
ramelg01
2022-02-10
Improve start-up time for winograd_output_transform_*_nhwc
ramelg01
2022-02-09
Remove deprecated remap functions.
Adnan AlSinan
2022-02-09
Improve start-up time for winograd_input_transform_*_nhwc
ramelg01
2022-02-08
Improve start-up time for winograd_filter_transform_*_nhwc
ramelg01
2022-02-02
Revert "Rework gemm_mm_reshaped_only_rhs_ kernels with new macros"
Ramy Elgammal
2022-01-25
Rework gemm_mm_reshaped_only_rhs_ kernels with new macros
Gian Marco Iodice
2021-12-23
Rework gemm_reshape_lhs_ with new macros
Adnan AlSinan
2021-12-13
Remove padding from ClDirectConv2dKernel
Adnan AlSinan
2021-12-10
Use #if directive instead of regular condition in CLDirectConv2D
Giorgio Arena
2021-12-01
Improve start-up direct convolution on OpenCL
Gian Marco Iodice
2021-11-29
Use loop unrolling only when the kernel height is less than 5
Gian Marco Iodice
2021-11-26
Rework gemm_reshape_rhs_(nt,t) with new macros
Gian Marco Iodice
2021-11-20
Improve start-up timer for GeMM (floating-point):
ramelg01
2021-11-17
Improve start-up timer for ClIm2Col
Giorgio Arena
2021-11-17
Improve start-up time for depthwise convolution
Sheri Zhang
2021-11-09
Improve start-up time for ClScale
Adnan AlSinan
2021-11-04
Add validate tests for CLConvolutionLayer and CLGEMMConvolutionLayer with pos...
SiCongLi
2021-11-04
Add PRelu to supported PostOps in:
ramelg01
2021-11-03
Fix out-of-bound reads in cl gemm kernels
SiCongLi
2021-11-02
Add post ops to ClGemmMatrixMultiplyReshapedOnlyRHSKernel and ClGemmMatrixMul...
SiCongLi
2021-11-01
Remove padding in FP Cl Gemm kernels
SiCongLi
2021-10-28
Add experimental PostOp interface to ClGemmMatrixMultiplyReshapedKernel Part 1
SiCongLi
2021-10-20
Implement CLDirectConv3DKernel - uint8/int8
Giorgio Arena
2021-10-18
Remove legacy GeMM kernels on OpenCL
Gian Marco Iodice
2021-10-18
Fix precision issue in ChannelShuffleKernel
Pablo Marquez Tello
2021-10-15
Fix CLConv3D filelist and comments
Giorgio Arena
2021-10-14
Implement CLDirectConv3D f32/f16
Giorgio Arena
2021-10-13
Improve performance of Softmax uint8 on GPU
Adnan AlSinan
2021-09-23
Fix inefficient store in gemmlowp_mm_reshaped_only_rhs_t
Gian Marco Iodice
2021-09-14
Optimize ClScaleKernel on NHWC (f32/f16/int8)
Gian Marco Iodice
2021-09-09
Remove padding from ClGemmMatrixMultiplyReshapedOnlyRhsKernel
Giorgio Arena
2021-09-08
Fix vload_partial macros on OpenCL
Giorgio Arena
2021-09-07
Remove padding from ClGemmMatrixMultiplyReshapedKernel
Giorgio Arena
2021-09-06
Revert "Remove padding from ClGemmMatrixMultiplyReshapedKernel"
Pablo Marquez Tello
2021-09-03
Remove padding from ClPool2dKernel NCHW
Giorgio Arena
2021-09-03
Fix CLNormalizationLayer NCHW border calculation
SiCongLi
2021-09-01
Remove padding from ClGemmMatrixMultiplyReshapedKernel
Michele Di Giorgio
[next]