aboutsummaryrefslogtreecommitdiff
path: root/src/gpu
diff options
context:
space:
mode:
authorGunes Bayir <gunes.bayir@arm.com>2023-12-13 11:51:05 +0000
committerGunes Bayir <gunes.bayir@arm.com>2023-12-14 17:10:49 +0000
commitfeef9b98c0cdc9df773fdf2d7b17496e0fa269a0 (patch)
tree94a772f79d3dc3dd8c2b9b81be9f024709f138c0 /src/gpu
parentb7e7e1a1ea5c8e176f2099990ecd97acee5eb8ca (diff)
downloadComputeLibrary-feef9b98c0cdc9df773fdf2d7b17496e0fa269a0.tar.gz
Fix validation error in CL generate proposals kernel
This fix modifies some of the conversions done in the generate proposals kernel that causes DDK issues while compiling the kernel. The issues are mostly related to conversion from i64 to fp16, and it doesn't affect fp32. Firstly, type identifier size_t is converted into unsigned int. But, this alone was compiling but causing mismatches, even in older devices, where it was passing before. Therefore, the fp16 conversion delayed until vector construction where the integers are now converted to fp32, and then fp16. This, although may not be ideal, seems like the best solution. Resolves: COMPMID-6756 Signed-off-by: Gunes Bayir <gunes.bayir@arm.com> Change-Id: Iee61216c908fe51431985b80c3653fc32add4741 Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/10879 Benchmark: Arm Jenkins <bsgcomp@arm.com> Tested-by: Arm Jenkins <bsgcomp@arm.com> Reviewed-by: Jakub Sujak <jakub.sujak@arm.com> Reviewed-by: Pablo Marquez Tello <pablo.tello@arm.com> Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'src/gpu')
0 files changed, 0 insertions, 0 deletions