aboutsummaryrefslogtreecommitdiff
path: root/arm_compute/core/CL/kernels/CLWinogradOutputTransformKernel.h
diff options
context:
space:
mode:
authorGiorgio Arena <giorgio.arena@arm.com>2018-04-23 17:41:22 +0100
committerAnthony Barbier <anthony.barbier@arm.com>2018-11-02 16:52:54 +0000
commit3695f9af9db2c14acee9af2fd68c44c737faa6ce (patch)
tree87aa336d6263cb00b01d5277b19178e80782f57a /arm_compute/core/CL/kernels/CLWinogradOutputTransformKernel.h
parentb62280aca3148dd6762e57e5af3da0cb0a9e2db5 (diff)
downloadComputeLibrary-3695f9af9db2c14acee9af2fd68c44c737faa6ce.tar.gz
COMPMID-1048 Add NHWC data format support to Winograd output transform 4x4_3x3
https://confluence.arm.com/display/MLENG/Winograd+Output+Transform%3A+NCHW+vs+NHWC+on+OpenCL Change-Id: I6995f5cef759ba70ebd96d545b952041b6f1f36e Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/128729 Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com> Tested-by: Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'arm_compute/core/CL/kernels/CLWinogradOutputTransformKernel.h')
-rw-r--r--arm_compute/core/CL/kernels/CLWinogradOutputTransformKernel.h2
1 files changed, 2 insertions, 0 deletions
diff --git a/arm_compute/core/CL/kernels/CLWinogradOutputTransformKernel.h b/arm_compute/core/CL/kernels/CLWinogradOutputTransformKernel.h
index 5e64a82e48..03e3bf5740 100644
--- a/arm_compute/core/CL/kernels/CLWinogradOutputTransformKernel.h
+++ b/arm_compute/core/CL/kernels/CLWinogradOutputTransformKernel.h
@@ -51,6 +51,7 @@ public:
* @note Winograd output transform supports the following configurations:
* F(output tile, kernel size):F(2x2, 3x3), F(4x4, 3x3), F(4x4, 5x5)
* Strides: only unit strides
+ * Data Layout: NCHW for all configurations, NHWC for F(4x4, 3x3)
*
* @param[in] input Source tensor with shape [C, N, K, batches]. Data types supported: F32.
* @param[in] bias Biases tensor. Shared biases supported. Biases are 1D tensor with dimensions [OFM]. It can be a nullptr. Data type supported: as @p input
@@ -63,6 +64,7 @@ public:
* @note Winograd output transform supports the following configurations:
* F(output tile, kernel size):F(2x2, 3x3), F(4x4, 3x3), F(4x4, 5x5)
* Strides: only unit strides
+ * Data Layout: NCHW for all configurations, NHWC for F(4x4, 3x3)
*
* @param[in] input Source tensor with shape [C, N, K, batches]. Data types supported: F32.
* @param[in] bias Biases tensor. Shared biases supported. Biases are 1D tensor with dimensions [OFM]. It can be a nullptr. Data type supported: as @p input