index
:
ComputeLibrary.git
branches/arm_compute_19_02
branches/arm_compute_19_05
branches/arm_compute_19_08
branches/arm_compute_19_11
branches/arm_compute_20_02
branches/arm_compute_20_05
branches/arm_compute_20_08
branches/arm_compute_20_11
branches/arm_compute_21_02
branches/arm_compute_21_05
branches/arm_compute_21_08
branches/arm_compute_21_11
branches/arm_compute_22_02
branches/arm_compute_22_05
branches/arm_compute_22_08
branches/arm_compute_22_11
branches/arm_compute_23_02
branches/arm_compute_23_02_1
branches/arm_compute_23_05
branches/arm_compute_23_05_1
branches/arm_compute_23_08
branches/arm_compute_23_11
branches/arm_compute_24_01
branches/arm_compute_24_02
branches/arm_compute_24_02_1
branches/arm_compute_24_04
branches/arm_compute_24_05
branches/arm_compute_24_06
branches/arm_compute_24_07
branches/arm_compute_24_08
branches/arm_compute_24_08_1
dev/21_02_int8_optim
dev/21_05_int8_optim
main
master
release_candidate
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
gpu
Age
Commit message (
Expand
)
Author
2022-11-25
Implement address precalculation for indirect conv2d - OpenCL
Gian Marco Iodice
2022-11-22
Remove dynamic fusion prototype with tests and examples
SiCong Li
2022-11-14
Optimize Transposed Convolution for CL backend (FP32/16)
Gunes Bayir
2022-11-01
Rework direct convolution heuristic on OpenCL
Gian Marco Iodice
2022-10-06
Rework DepthwiseConvolution heuristic on OpenCL
Gian Marco Iodice
2022-10-06
Improve start-up time in gemmlowp reshaped rhs only.
Adnan AlSinan
2022-10-04
Update GEMM reshaped rhs only heuristic
Gian Marco Iodice
2022-10-03
Force CL kernel compilation with 64 registers
Viet-Hoa Do
2022-09-16
Fix validation in validate_image2d_support_on_rhs
Gian Marco Iodice
2022-09-09
Rework heuristic in ClConv2d
Gian Marco Iodice
2022-09-09
Add a macro guard in all OpenCL kernels in gemmlowp.cl
Gian Marco Iodice
2022-09-02
Enable Winograd-based conv2d when IFM>=8 on Gpu
Gian Marco Iodice
2022-08-17
Revert "Fix performance regression in ClConv2D"
Ramy Elgammal
2022-08-16
Fix performance regression in ClConv2D
Gian Marco Iodice
2022-08-11
Fix performance regression in Conv2D on OpenCL
Adnan AlSinan
2022-08-11
Disable unsafe FP optimizations in Winograd Output Transform
Gunes Bayir
2022-08-05
Fix LeNet-f16 convolution regression
Adnan AlSinan
2022-07-22
Add GemmLowp MMUL Reshaped Only Rhs Support for QASYMM8/QASYMM8_SIGNED
Freddie Liardet
2022-07-22
Update ClConv2D heuristic to use direct convolution
Adnan AlSinan
2022-07-13
Add Gemm MMUL Reshaped Only Rhs Support for FP32/FP16
Gunes Bayir
2022-07-08
Extended direct conv 2d interface for tuning the OpenCl kernel
Gian Marco Iodice
2022-06-28
Fix OpenCL Winograd output transform
Gian Marco Iodice
2022-06-15
Fix performance regression in Winograd Output Transform (OpenCL)
Gian Marco Iodice
2022-05-26
Disable unsafe FP optimizations causing accuracy issues
Gunes Bayir
2022-05-11
Fix inclusion guard for dynamic fusion module
SiCong Li
2022-05-06
Integrate Dynamic Fusion patches
SiCong Li
2022-04-19
Add CLPool3d Int8 Support
Mohammed Suhail Munshi
2022-04-14
Enable dynamic cl tuning for dynamically fused kernels
SiCong Li
2022-04-14
Include missing embedded headers
SiCong Li
2022-04-13
Add DirectConvolution2D kernel component for dynamic fusion
Gunes Bayir
2022-03-31
Fix embedded kernel header inclusion for dynamic fusion
Giorgio Arena
2022-03-15
Implementation of ClPooling3d
ramelg01
2022-03-08
Merge kernel prototype patch
Giorgio Arena
2022-02-21
Fix performance regression on Arm(R) Mali(TM)-G71
Gian Marco Iodice
2022-02-11
Improve start-up time for concatenation layers
ramelg01
2022-02-10
Fix performance regression on the first layer of convolution-based model
Gian Marco Iodice
2022-02-10
Improve start-up time for winograd_output_transform_*_nhwc
ramelg01
2022-02-09
Remove deprecated remap functions.
Adnan AlSinan
2022-02-09
Improve start-up time for winograd_input_transform_*_nhwc
ramelg01
2022-02-08
Improve start-up time for winograd_filter_transform_*_nhwc
ramelg01
2022-02-02
Revert "Rework gemm_mm_reshaped_only_rhs_ kernels with new macros"
Ramy Elgammal
2022-01-25
Rework gemm_mm_reshaped_only_rhs_ kernels with new macros
Gian Marco Iodice
2022-01-21
Fix heuristic in ClConv2D
Gian Marco Iodice
2022-01-12
Enabled support for QASYMM8 in ClCastKernel
Pablo Marquez Tello
2021-12-25
Add tests for FP Cpu Pooling where pool region is completely outside the input
SiCongLi
2021-12-23
Rework gemm_reshape_lhs_ with new macros
Adnan AlSinan
2021-12-13
Remove padding from ClDirectConv2dKernel
Adnan AlSinan
2021-12-10
Use #if directive instead of regular condition in CLDirectConv2D
Giorgio Arena
2021-12-01
Improve start-up direct convolution on OpenCL
Gian Marco Iodice
2021-11-26
Rework gemm_reshape_rhs_(nt,t) with new macros
Gian Marco Iodice
[next]