index
:
ComputeLibrary.git
branches/arm_compute_19_02
branches/arm_compute_19_05
branches/arm_compute_19_08
branches/arm_compute_19_11
branches/arm_compute_20_02
branches/arm_compute_20_05
branches/arm_compute_20_08
branches/arm_compute_20_11
branches/arm_compute_21_02
branches/arm_compute_21_05
branches/arm_compute_21_08
branches/arm_compute_21_11
branches/arm_compute_22_02
branches/arm_compute_22_05
branches/arm_compute_22_08
branches/arm_compute_22_11
branches/arm_compute_23_02
branches/arm_compute_23_02_1
branches/arm_compute_23_05
branches/arm_compute_23_05_1
branches/arm_compute_23_08
branches/arm_compute_23_11
branches/arm_compute_24_01
branches/arm_compute_24_02
branches/arm_compute_24_02_1
branches/arm_compute_24_04
branches/arm_compute_24_05
branches/arm_compute_24_06
branches/arm_compute_24_07
dev/21_02_int8_optim
dev/21_05_int8_optim
main
master
release_candidate
about
summary
refs
log
tree
commit
diff
log msg
author
committer
range
path:
root
/
src
/
core
/
CL
/
cl_kernels
/
nhwc
Age
Commit message (
Expand
)
Author
2023-01-18
Revert "Update the heuristic for CLDepthwiseConvolutionNative kernel"
Gian Marco Iodice
2023-01-12
Update the heuristic for CLDepthwiseConvolutionNative kernel
Gian Marco Iodice
2023-01-10
Extend cl image support to input and output tensors
Gian Marco Iodice
2022-12-29
Optimize CL Scale/Resize Quantized by removing (de)quant. code
Gunes Bayir
2022-12-29
Extend Transposed Conv. for tiles with N0>1
Gunes Bayir
2022-12-21
Update direct conv2d kernel in dynamic fusion
Gian Marco Iodice
2022-12-14
Optimize Transposed Convolution for CL backend (Quantized)
Gunes Bayir
2022-12-09
Implement the OpenCL kernel to compute the indirect convolution
Gian Marco Iodice
2022-11-25
Implement address precalculation for indirect conv2d - OpenCL
Gian Marco Iodice
2022-11-14
Optimize Transposed Convolution for CL backend (FP32/16)
Gunes Bayir
2022-11-01
Rework direct convolution heuristic on OpenCL
Gian Marco Iodice
2022-10-07
Workaround CL compiler issue on FP16
Viet-Hoa Do
2022-10-06
Rework DepthwiseConvolution heuristic on OpenCL
Gian Marco Iodice
2022-09-07
Optimize depthwise convolution on OpenCL
Gian Marco Iodice
2022-07-21
Fix direct convolution cases that were failing on Odroid
Adnan AlSinan
2022-06-27
Implement new Elementwise Dynamic Fusion Operators: Div, Floor
Michalis Spyrou
2022-06-15
Fix performance regression in Winograd Output Transform (OpenCL)
Gian Marco Iodice
2022-04-19
Add CLPool3d Int8 Support
Mohammed Suhail Munshi
2022-03-15
Implementation of ClPooling3d
ramelg01
2022-02-10
Improve start-up time for winograd_output_transform_*_nhwc
ramelg01
2022-02-09
Remove deprecated remap functions.
Adnan AlSinan
2022-02-09
Improve start-up time for winograd_input_transform_*_nhwc
ramelg01
2022-02-08
Improve start-up time for winograd_filter_transform_*_nhwc
ramelg01
2021-12-10
Use #if directive instead of regular condition in CLDirectConv2D
Giorgio Arena
2021-12-01
Improve start-up direct convolution on OpenCL
Gian Marco Iodice
2021-11-29
Use loop unrolling only when the kernel height is less than 5
Gian Marco Iodice
2021-11-17
Improve start-up timer for ClIm2Col
Giorgio Arena
2021-11-17
Improve start-up time for depthwise convolution
Sheri Zhang
2021-11-09
Improve start-up time for ClScale
Adnan AlSinan
2021-10-20
Implement CLDirectConv3DKernel - uint8/int8
Giorgio Arena
2021-10-15
Fix CLConv3D filelist and comments
Giorgio Arena
2021-10-14
Implement CLDirectConv3D f32/f16
Giorgio Arena
2021-09-14
Optimize ClScaleKernel on NHWC (f32/f16/int8)
Gian Marco Iodice
2021-08-23
Remove padding from ClScaleKernel
Giorgio Arena
2021-07-25
Reorganize the kernels into nhwc, nchw and common folders
Adnan AlSinan