diff options
author | Giorgio Arena <giorgio.arena@arm.com> | 2021-04-16 17:03:39 +0100 |
---|---|---|
committer | Giorgio Arena <giorgio.arena@arm.com> | 2021-04-20 09:26:59 +0000 |
commit | ada6cbc057ff725e57d301a99a1816ce602485b9 (patch) | |
tree | f869994cb2b061de0bc4731d720336413b81d32a /src/core/CL/ICLKernel.cpp | |
parent | 031d6a97de79fc3ca3eb6fca1611f03aa9b5893b (diff) | |
download | ComputeLibrary-ada6cbc057ff725e57d301a99a1816ce602485b9.tar.gz |
Remove OpenCL padding: CLPixelWiseMultiplicationKernel
- Change kernel's vec_size to 16 / sizeof(output)
- Change ICLKernel.cpp to handle broadcast without padding
Resolve COMPMID-3913
Signed-off-by: Giorgio Arena <giorgio.arena@arm.com>
Change-Id: I03e884b250ef5784dc109bff8cf2c96b345d119f
Signed-off-by: Giorgio Arena <giorgio.arena@arm.com>
Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/5450
Tested-by: Arm Jenkins <bsgcomp@arm.com>
Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
Reviewed-by: Michele Di Giorgio <michele.digiorgio@arm.com>
Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com>
Diffstat (limited to 'src/core/CL/ICLKernel.cpp')
-rw-r--r-- | src/core/CL/ICLKernel.cpp | 4 |
1 files changed, 2 insertions, 2 deletions
diff --git a/src/core/CL/ICLKernel.cpp b/src/core/CL/ICLKernel.cpp index 1c6963f3f1..9ba17d0e03 100644 --- a/src/core/CL/ICLKernel.cpp +++ b/src/core/CL/ICLKernel.cpp @@ -105,8 +105,8 @@ void ICLKernel::add_tensor_argument(unsigned &idx, const ICLTensor *tensor, cons for(unsigned int d = 0; d < dimension_size; ++d) { - _kernel.setArg<cl_uint>(idx++, strides[d]); - _kernel.setArg<cl_uint>(idx++, strides[d] * window[d].step()); + _kernel.setArg<cl_uint>(idx++, window.is_broadcasted(d) ? 0 : strides[d]); + _kernel.setArg<cl_uint>(idx++, window.is_broadcasted(d) ? 0 : (strides[d] * window[d].step())); } _kernel.setArg<cl_uint>(idx++, offset_first_element); |