diff options
author | Gian Marco Iodice <gianmarco.iodice@arm.com> | 2023-04-14 12:20:58 +0100 |
---|---|---|
committer | Gian Marco Iodice <gianmarco.iodice@arm.com> | 2023-04-26 11:08:40 +0000 |
commit | 905a3c1a8883d988edf5bdc749844a4565fe5623 (patch) | |
tree | 2a9a98a572cac20ac161a8f8a2003c4bd7e7c6e3 /src/gpu/cl/IClKernel.h | |
parent | b2758f35da97319fd15722485e9b4ba7b35c8cfa (diff) | |
download | ComputeLibrary-905a3c1a8883d988edf5bdc749844a4565fe5623.tar.gz |
Improve Winograd performance on OpenCL
- Performs more output elements per work-item in the case of Fp16
computation in Winograd Input/Output transform
Resolves COMPMID-6018
Signed-off-by: Gian Marco Iodice <gianmarco.iodice@arm.com>
Change-Id: If5e6f5182eff8c1f05a3505c437d0a997490f0bd
Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/9447
Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
Tested-by: Arm Jenkins <bsgcomp@arm.com>
Reviewed-by: Jakub Sujak <jakub.sujak@arm.com>
Reviewed-by: Viet-Hoa Do <viet-hoa.do@arm.com>
Benchmark: Arm Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'src/gpu/cl/IClKernel.h')
0 files changed, 0 insertions, 0 deletions