index
:
ComputeLibrary.git
branches/arm_compute_19_02
branches/arm_compute_19_05
branches/arm_compute_19_08
branches/arm_compute_19_11
branches/arm_compute_20_02
branches/arm_compute_20_05
branches/arm_compute_20_08
branches/arm_compute_20_11
branches/arm_compute_21_02
branches/arm_compute_21_05
branches/arm_compute_21_08
branches/arm_compute_21_11
branches/arm_compute_22_02
branches/arm_compute_22_05
branches/arm_compute_22_08
branches/arm_compute_22_11
branches/arm_compute_23_02
branches/arm_compute_23_02_1
branches/arm_compute_23_05
branches/arm_compute_23_05_1
branches/arm_compute_23_08
branches/arm_compute_23_11
branches/arm_compute_24_01
branches/arm_compute_24_02
branches/arm_compute_24_02_1
branches/arm_compute_24_04
branches/arm_compute_24_05
branches/arm_compute_24_06
branches/arm_compute_24_07
branches/arm_compute_24_08
dev/21_02_int8_optim
dev/21_05_int8_optim
main
master
release_candidate
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
gpu
/
cl
/
kernels
Age
Commit message (
Expand
)
Author
2023-04-17
Add quantized CL MatMul kernels for Lhs NT/T, Rhs NT
Gunes Bayir
2023-04-14
Align naming convention of ClMatMul
Jakub Sujak
2023-04-03
Implement MatMul Function
Ramy Elgammal
2023-03-24
Work around CLScale compiler-specific issue
SiCong Li
2023-03-24
Add Texture Pipe Support for Matmul Lhs T/NT Rhs NT kernels
Gunes Bayir
2023-03-20
Implement OpenCL MatMul for Lhs T Rhs T/NT FP32/16
Gunes Bayir
2023-03-17
Implementation of RSQRT for quantized int8
Ramy Elgammal
2023-03-17
Implement OpenCL MatMul for Lhs NT Rhs T/NT FP32/16
Ramy Elgammal
2023-03-06
Fix LWS search space used by CLTuner
SiCong Li
2023-02-28
Add an option to use lowest for max-pooling
Adnan AlSinan
2023-01-10
Fix CL DirectConvolutionLayer validate tests
SiCong Li
2023-01-10
Extend cl image support to input and output tensors
Gian Marco Iodice
2022-12-29
Optimize CL Scale/Resize Quantized by removing (de)quant. code
Gunes Bayir
2022-12-29
Extend Transposed Conv. for tiles with N0>1
Gunes Bayir
2022-12-23
Make CLReshape kernel window based on dst instead of src
Ramy Elgammal
2022-12-14
Optimize Transposed Convolution for CL backend (Quantized)
Gunes Bayir
2022-12-09
Use heuristics for setting dynamic fusion direct conv2d tile sizes
Ramy Elgammal
2022-12-09
Implement the OpenCL kernel to compute the indirect convolution
Gian Marco Iodice
2022-11-25
Implement address precalculation for indirect conv2d - OpenCL
Gian Marco Iodice
2022-11-22
Remove dynamic fusion prototype with tests and examples
SiCong Li
2022-11-14
Optimize Transposed Convolution for CL backend (FP32/16)
Gunes Bayir
2022-11-01
Rework direct convolution heuristic on OpenCL
Gian Marco Iodice
2022-10-06
Rework DepthwiseConvolution heuristic on OpenCL
Gian Marco Iodice
2022-10-06
Improve start-up time in gemmlowp reshaped rhs only.
Adnan AlSinan
2022-10-04
Update GEMM reshaped rhs only heuristic
Gian Marco Iodice
2022-10-03
Force CL kernel compilation with 64 registers
Viet-Hoa Do
2022-09-16
Fix validation in validate_image2d_support_on_rhs
Gian Marco Iodice
2022-09-09
Add a macro guard in all OpenCL kernels in gemmlowp.cl
Gian Marco Iodice
2022-08-11
Disable unsafe FP optimizations in Winograd Output Transform
Gunes Bayir
2022-07-22
Add GemmLowp MMUL Reshaped Only Rhs Support for QASYMM8/QASYMM8_SIGNED
Freddie Liardet
2022-07-13
Add Gemm MMUL Reshaped Only Rhs Support for FP32/FP16
Gunes Bayir
2022-07-08
Extended direct conv 2d interface for tuning the OpenCl kernel
Gian Marco Iodice
2022-06-28
Fix OpenCL Winograd output transform
Gian Marco Iodice
2022-06-15
Fix performance regression in Winograd Output Transform (OpenCL)
Gian Marco Iodice
2022-05-26
Disable unsafe FP optimizations causing accuracy issues
Gunes Bayir
2022-05-11
Fix inclusion guard for dynamic fusion module
SiCong Li
2022-05-06
Integrate Dynamic Fusion patches
SiCong Li
2022-04-19
Add CLPool3d Int8 Support
Mohammed Suhail Munshi
2022-04-14
Enable dynamic cl tuning for dynamically fused kernels
SiCong Li
2022-04-13
Add DirectConvolution2D kernel component for dynamic fusion
Gunes Bayir
2022-03-15
Implementation of ClPooling3d
ramelg01
2022-03-08
Merge kernel prototype patch
Giorgio Arena
2022-02-11
Improve start-up time for concatenation layers
ramelg01
2022-02-10
Improve start-up time for winograd_output_transform_*_nhwc
ramelg01
2022-02-09
Improve start-up time for winograd_input_transform_*_nhwc
ramelg01
2022-02-08
Improve start-up time for winograd_filter_transform_*_nhwc
ramelg01
2022-02-02
Revert "Rework gemm_mm_reshaped_only_rhs_ kernels with new macros"
Ramy Elgammal
2022-01-25
Rework gemm_mm_reshaped_only_rhs_ kernels with new macros
Gian Marco Iodice
2022-01-12
Enabled support for QASYMM8 in ClCastKernel
Pablo Marquez Tello
2021-12-25
Add tests for FP Cpu Pooling where pool region is completely outside the input
SiCongLi
[next]