index
:
ComputeLibrary.git
branches/arm_compute_19_02
branches/arm_compute_19_05
branches/arm_compute_19_08
branches/arm_compute_19_11
branches/arm_compute_20_02
branches/arm_compute_20_05
branches/arm_compute_20_08
branches/arm_compute_20_11
branches/arm_compute_21_02
branches/arm_compute_21_05
branches/arm_compute_21_08
branches/arm_compute_21_11
branches/arm_compute_22_02
branches/arm_compute_22_05
branches/arm_compute_22_08
branches/arm_compute_22_11
branches/arm_compute_23_02
branches/arm_compute_23_02_1
branches/arm_compute_23_05
branches/arm_compute_23_05_1
branches/arm_compute_23_08
branches/arm_compute_23_11
branches/arm_compute_24_01
branches/arm_compute_24_02
branches/arm_compute_24_02_1
branches/arm_compute_24_04
branches/arm_compute_24_05
branches/arm_compute_24_06
branches/arm_compute_24_07
branches/arm_compute_24_08
dev/21_02_int8_optim
dev/21_05_int8_optim
main
master
release_candidate
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
gpu
/
cl
Age
Commit message (
Expand
)
Author
2023-01-18
Add broadcast batched matmul validation cases
SiCong Li
2023-01-17
Fix ClGemm crashes on unsupported data types
SiCong Li
2023-01-10
Fix CL DirectConvolutionLayer validate tests
SiCong Li
2023-01-10
Extend cl image support to input and output tensors
Gian Marco Iodice
2022-12-29
Optimize CL Scale/Resize Quantized by removing (de)quant. code
Gunes Bayir
2022-12-29
Update the ClConv2d heuristic
Gian Marco Iodice
2022-12-29
Extend Transposed Conv. for tiles with N0>1
Gunes Bayir
2022-12-23
Make CLReshape kernel window based on dst instead of src
Ramy Elgammal
2022-12-14
Optimize Transposed Convolution for CL backend (Quantized)
Gunes Bayir
2022-12-13
Add CLAMP operator to Dynamic Fusion interface
Jakub Sujak
2022-12-12
Fix build error resulting from incorrect header path
Jakub Sujak
2022-12-09
Use heuristics for setting dynamic fusion direct conv2d tile sizes
Ramy Elgammal
2022-12-09
Implement the OpenCL kernel to compute the indirect convolution
Gian Marco Iodice
2022-11-25
Implement address precalculation for indirect conv2d - OpenCL
Gian Marco Iodice
2022-11-22
Remove dynamic fusion prototype with tests and examples
SiCong Li
2022-11-14
Optimize Transposed Convolution for CL backend (FP32/16)
Gunes Bayir
2022-11-01
Rework direct convolution heuristic on OpenCL
Gian Marco Iodice
2022-10-06
Rework DepthwiseConvolution heuristic on OpenCL
Gian Marco Iodice
2022-10-06
Improve start-up time in gemmlowp reshaped rhs only.
Adnan AlSinan
2022-10-04
Update GEMM reshaped rhs only heuristic
Gian Marco Iodice
2022-10-03
Force CL kernel compilation with 64 registers
Viet-Hoa Do
2022-09-16
Fix validation in validate_image2d_support_on_rhs
Gian Marco Iodice
2022-09-09
Rework heuristic in ClConv2d
Gian Marco Iodice
2022-09-09
Add a macro guard in all OpenCL kernels in gemmlowp.cl
Gian Marco Iodice
2022-09-02
Enable Winograd-based conv2d when IFM>=8 on Gpu
Gian Marco Iodice
2022-08-17
Revert "Fix performance regression in ClConv2D"
Ramy Elgammal
2022-08-16
Fix performance regression in ClConv2D
Gian Marco Iodice
2022-08-11
Fix performance regression in Conv2D on OpenCL
Adnan AlSinan
2022-08-11
Disable unsafe FP optimizations in Winograd Output Transform
Gunes Bayir
2022-08-05
Fix LeNet-f16 convolution regression
Adnan AlSinan
2022-07-22
Add GemmLowp MMUL Reshaped Only Rhs Support for QASYMM8/QASYMM8_SIGNED
Freddie Liardet
2022-07-22
Update ClConv2D heuristic to use direct convolution
Adnan AlSinan
2022-07-13
Add Gemm MMUL Reshaped Only Rhs Support for FP32/FP16
Gunes Bayir
2022-07-08
Extended direct conv 2d interface for tuning the OpenCl kernel
Gian Marco Iodice
2022-06-28
Fix OpenCL Winograd output transform
Gian Marco Iodice
2022-06-15
Fix performance regression in Winograd Output Transform (OpenCL)
Gian Marco Iodice
2022-05-26
Disable unsafe FP optimizations causing accuracy issues
Gunes Bayir
2022-05-11
Fix inclusion guard for dynamic fusion module
SiCong Li
2022-05-06
Integrate Dynamic Fusion patches
SiCong Li
2022-04-19
Add CLPool3d Int8 Support
Mohammed Suhail Munshi
2022-04-14
Enable dynamic cl tuning for dynamically fused kernels
SiCong Li
2022-04-14
Include missing embedded headers
SiCong Li
2022-04-13
Add DirectConvolution2D kernel component for dynamic fusion
Gunes Bayir
2022-03-31
Fix embedded kernel header inclusion for dynamic fusion
Giorgio Arena
2022-03-15
Implementation of ClPooling3d
ramelg01
2022-03-08
Merge kernel prototype patch
Giorgio Arena
2022-02-21
Fix performance regression on Arm(R) Mali(TM)-G71
Gian Marco Iodice
2022-02-11
Improve start-up time for concatenation layers
ramelg01
2022-02-10
Fix performance regression on the first layer of convolution-based model
Gian Marco Iodice
2022-02-10
Improve start-up time for winograd_output_transform_*_nhwc
ramelg01
[next]