aboutsummaryrefslogtreecommitdiff
path: root/arm_compute/runtime/CL/functions/CLStackLayer.h
diff options
context:
space:
mode:
authorGian Marco Iodice <gianmarco.iodice@arm.com>2021-04-30 10:16:47 +0100
committerGeorgios Pinitas <georgios.pinitas@arm.com>2021-04-30 13:40:40 +0000
commit50929ef951880469b9d579323d2f9c9f5025327d (patch)
treeac16f522961ee95ed5da8e5d54d53fe6f8420bcf /arm_compute/runtime/CL/functions/CLStackLayer.h
parente5df1d51177c0682622426c97de620c5ad05d341 (diff)
downloadComputeLibrary-50929ef951880469b9d579323d2f9c9f5025327d.tar.gz
Add optimization for global pooling in pooling_layer.cl
- Simplify the implementation when the pooling size has the same spatial dimensions of the input tensor - Rework the heuristic for F32/F16 - Add test for validating the global pooling path - Fix compare_dimensions in validation. The validation fails because we have different number of dimensions for NCHW and NHWC (e.g. 1,1,2,1(NCHW) -> 2,1,1,1(NHWC) Resolves COMPMID-4426 Change-Id: Ia53ee659a9fbc3d011f286a8150d1be9d6d2cd05 Signed-off-by: Gian Marco Iodice <gianmarco.iodice@arm.com> Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/5533 Tested-by: Arm Jenkins <bsgcomp@arm.com> Reviewed-by: Michalis Spyrou <michalis.spyrou@arm.com> Reviewed-by: TeresaARM <teresa.charlinreyes@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com> Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'arm_compute/runtime/CL/functions/CLStackLayer.h')
0 files changed, 0 insertions, 0 deletions