aboutsummaryrefslogtreecommitdiff
path: root/src/runtime/CL
diff options
context:
space:
mode:
authorGian Marco Iodice <gianmarco.iodice@arm.com>2023-04-14 12:20:58 +0100
committerGian Marco Iodice <gianmarco.iodice@arm.com>2023-04-26 11:08:40 +0000
commit905a3c1a8883d988edf5bdc749844a4565fe5623 (patch)
tree2a9a98a572cac20ac161a8f8a2003c4bd7e7c6e3 /src/runtime/CL
parentb2758f35da97319fd15722485e9b4ba7b35c8cfa (diff)
downloadComputeLibrary-905a3c1a8883d988edf5bdc749844a4565fe5623.tar.gz
Improve Winograd performance on OpenCL
- Performs more output elements per work-item in the case of Fp16 computation in Winograd Input/Output transform Resolves COMPMID-6018 Signed-off-by: Gian Marco Iodice <gianmarco.iodice@arm.com> Change-Id: If5e6f5182eff8c1f05a3505c437d0a997490f0bd Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/9447 Comments-Addressed: Arm Jenkins <bsgcomp@arm.com> Tested-by: Arm Jenkins <bsgcomp@arm.com> Reviewed-by: Jakub Sujak <jakub.sujak@arm.com> Reviewed-by: Viet-Hoa Do <viet-hoa.do@arm.com> Benchmark: Arm Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'src/runtime/CL')
0 files changed, 0 insertions, 0 deletions