aboutsummaryrefslogtreecommitdiff
path: root/src/core/CL/kernels/CLDepthwiseConvolutionLayerNativeKernel.h
diff options
context:
space:
mode:
authorGian Marco Iodice <gianmarco.iodice@arm.com>2021-04-01 16:17:16 +0100
committerGian Marco Iodice <gianmarco.iodice@arm.com>2021-04-08 10:00:11 +0000
commit534b889482967a4b4e7d6443bad4e4bdcb4999d4 (patch)
tree173890ba83eb6ce24266304c983a347b4d3fccc2 /src/core/CL/kernels/CLDepthwiseConvolutionLayerNativeKernel.h
parent68508897deafe26b5d50566a6ca3ba70c728dd12 (diff)
downloadComputeLibrary-534b889482967a4b4e7d6443bad4e4bdcb4999d4.tar.gz
Rework the OpenCL Winograd Input Transformations NHWC
- Rework Winograd Input Transform 3x3 NHWC using the new macros - Rework Winograd Input Transform 5x5 NHWC using the new macros - Rework Winograd Input Transform 7x7 NHWC using the new macros - The new implementation is also faster than before - Winograd Input Transform 5x5/7x7 3x faster Resolves COMPMID-4139 Change-Id: Ia9c8af23a2d47d2db60ec4c44650a63a34ffa0d5 Signed-off-by: Gian Marco Iodice <gianmarco.iodice@arm.com> Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/5358 Tested-by: Arm Jenkins <bsgcomp@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com> Reviewed-by: Michele Di Giorgio <michele.digiorgio@arm.com>
Diffstat (limited to 'src/core/CL/kernels/CLDepthwiseConvolutionLayerNativeKernel.h')
0 files changed, 0 insertions, 0 deletions