diff options
author | Giorgio Arena <giorgio.arena@arm.com> | 2018-04-23 17:41:22 +0100 |
---|---|---|
committer | Anthony Barbier <anthony.barbier@arm.com> | 2018-11-02 16:52:54 +0000 |
commit | 3695f9af9db2c14acee9af2fd68c44c737faa6ce (patch) | |
tree | 87aa336d6263cb00b01d5277b19178e80782f57a /arm_compute/core/CL/kernels/CLWinogradOutputTransformKernel.h | |
parent | b62280aca3148dd6762e57e5af3da0cb0a9e2db5 (diff) | |
download | ComputeLibrary-3695f9af9db2c14acee9af2fd68c44c737faa6ce.tar.gz |
COMPMID-1048 Add NHWC data format support to Winograd output transform 4x4_3x3
https://confluence.arm.com/display/MLENG/Winograd+Output+Transform%3A+NCHW+vs+NHWC+on+OpenCL
Change-Id: I6995f5cef759ba70ebd96d545b952041b6f1f36e
Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/128729
Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com>
Tested-by: Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'arm_compute/core/CL/kernels/CLWinogradOutputTransformKernel.h')
-rw-r--r-- | arm_compute/core/CL/kernels/CLWinogradOutputTransformKernel.h | 2 |
1 files changed, 2 insertions, 0 deletions
diff --git a/arm_compute/core/CL/kernels/CLWinogradOutputTransformKernel.h b/arm_compute/core/CL/kernels/CLWinogradOutputTransformKernel.h index 5e64a82e48..03e3bf5740 100644 --- a/arm_compute/core/CL/kernels/CLWinogradOutputTransformKernel.h +++ b/arm_compute/core/CL/kernels/CLWinogradOutputTransformKernel.h @@ -51,6 +51,7 @@ public: * @note Winograd output transform supports the following configurations: * F(output tile, kernel size):F(2x2, 3x3), F(4x4, 3x3), F(4x4, 5x5) * Strides: only unit strides + * Data Layout: NCHW for all configurations, NHWC for F(4x4, 3x3) * * @param[in] input Source tensor with shape [C, N, K, batches]. Data types supported: F32. * @param[in] bias Biases tensor. Shared biases supported. Biases are 1D tensor with dimensions [OFM]. It can be a nullptr. Data type supported: as @p input @@ -63,6 +64,7 @@ public: * @note Winograd output transform supports the following configurations: * F(output tile, kernel size):F(2x2, 3x3), F(4x4, 3x3), F(4x4, 5x5) * Strides: only unit strides + * Data Layout: NCHW for all configurations, NHWC for F(4x4, 3x3) * * @param[in] input Source tensor with shape [C, N, K, batches]. Data types supported: F32. * @param[in] bias Biases tensor. Shared biases supported. Biases are 1D tensor with dimensions [OFM]. It can be a nullptr. Data type supported: as @p input |