aboutsummaryrefslogtreecommitdiff
path: root/arm_compute/core/CL/kernels/CLWinogradInputTransformKernel.h
diff options
context:
space:
mode:
authorGiorgio Arena <giorgio.arena@arm.com>2018-04-26 11:33:05 +0100
committerAnthony Barbier <anthony.barbier@arm.com>2018-11-02 16:53:09 +0000
commitc42f28d45e9b990276d54880d2cee9c9ee675a41 (patch)
tree5b407f4cc8abb67ca3c9f95c1f59e3f79859495a /arm_compute/core/CL/kernels/CLWinogradInputTransformKernel.h
parent376c85f3d826526b8b197c55e22c10765a97631e (diff)
downloadComputeLibrary-c42f28d45e9b990276d54880d2cee9c9ee675a41.tar.gz
COMPMID-1048 Add NHWC data format support to Winograd input transform 4x4_3x3
https://confluence.arm.com/display/MLENG/Winograd+Input+Transform%3A+NCHW+vs+NHWC+on+OpenCL Change-Id: Iac35a54389266701b7d8f5434a7a37df85b7b187 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/133315 Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com> Tested-by: Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'arm_compute/core/CL/kernels/CLWinogradInputTransformKernel.h')
-rw-r--r--arm_compute/core/CL/kernels/CLWinogradInputTransformKernel.h2
1 files changed, 2 insertions, 0 deletions
diff --git a/arm_compute/core/CL/kernels/CLWinogradInputTransformKernel.h b/arm_compute/core/CL/kernels/CLWinogradInputTransformKernel.h
index b92ff2f60c..58e8291161 100644
--- a/arm_compute/core/CL/kernels/CLWinogradInputTransformKernel.h
+++ b/arm_compute/core/CL/kernels/CLWinogradInputTransformKernel.h
@@ -49,6 +49,7 @@ public:
* @note Winograd input transform supports the following configurations:
* F(output tile, kernel size):F(2x2, 3x3), F(4x4, 3x3), F(4x4, 5x5)
* Strides: only unit strides
+ * Data Layout: NCHW for all configurations, NHWC for F(4x4, 3x3)
*
* @param[in] input The input tensor to transform. Data types supported: F32
* @param[in] output The output tensor. The shape for this tensor can be calculated using the utility function @p compute_winograd_input_transform_shape. Data types supported: Same as @p input
@@ -60,6 +61,7 @@ public:
* @note Winograd input transform supports the following configurations:
* F(output tile, kernel size):F(2x2, 3x3), F(4x4, 3x3), F(4x4, 5x5)
* Strides: only unit strides
+ * Data Layout: NCHW for all configurations, NHWC for F(4x4, 3x3)
*
* @param[in] input The input tensor to transform. Data types supported: F32
* @param[in] output The output tensor. The shape for this tensor can be calculated using the utility function @p compute_winograd_input_transform_shape. Data types supported: Same as @p input