diff options
author | Giorgio Arena <giorgio.arena@arm.com> | 2021-04-16 17:03:39 +0100 |
---|---|---|
committer | Giorgio Arena <giorgio.arena@arm.com> | 2021-04-20 09:26:59 +0000 |
commit | ada6cbc057ff725e57d301a99a1816ce602485b9 (patch) | |
tree | f869994cb2b061de0bc4731d720336413b81d32a /src/runtime/gpu/cl/operators/ClPixelWiseMultiplication.h | |
parent | 031d6a97de79fc3ca3eb6fca1611f03aa9b5893b (diff) | |
download | ComputeLibrary-ada6cbc057ff725e57d301a99a1816ce602485b9.tar.gz |
Remove OpenCL padding: CLPixelWiseMultiplicationKernel
- Change kernel's vec_size to 16 / sizeof(output)
- Change ICLKernel.cpp to handle broadcast without padding
Resolve COMPMID-3913
Signed-off-by: Giorgio Arena <giorgio.arena@arm.com>
Change-Id: I03e884b250ef5784dc109bff8cf2c96b345d119f
Signed-off-by: Giorgio Arena <giorgio.arena@arm.com>
Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/5450
Tested-by: Arm Jenkins <bsgcomp@arm.com>
Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
Reviewed-by: Michele Di Giorgio <michele.digiorgio@arm.com>
Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com>
Diffstat (limited to 'src/runtime/gpu/cl/operators/ClPixelWiseMultiplication.h')
-rw-r--r-- | src/runtime/gpu/cl/operators/ClPixelWiseMultiplication.h | 12 |
1 files changed, 0 insertions, 12 deletions
diff --git a/src/runtime/gpu/cl/operators/ClPixelWiseMultiplication.h b/src/runtime/gpu/cl/operators/ClPixelWiseMultiplication.h index e9b3e4a5ef..e1598cb870 100644 --- a/src/runtime/gpu/cl/operators/ClPixelWiseMultiplication.h +++ b/src/runtime/gpu/cl/operators/ClPixelWiseMultiplication.h @@ -99,12 +99,6 @@ public: */ static Status validate(const ITensorInfo *src1, const ITensorInfo *src2, const ITensorInfo *dst, float scale, ConvertPolicy overflow_policy, RoundingPolicy rounding_policy, const ActivationLayerInfo &act_info = ActivationLayerInfo()); - - // Inherited methods overridden: - void run(ITensorPack &tensors) override; - -private: - std::unique_ptr<ICLKernel> _border_handler{ nullptr }; }; /** Basic function to run @ref opencl::ClComplexPixelWiseMultiplication. */ @@ -132,12 +126,6 @@ public: * @param[in] act_info (Optional) Activation layer information in case of a fused activation. */ static Status validate(const ITensorInfo *src1, const ITensorInfo *src2, const ITensorInfo *dst, const ActivationLayerInfo &act_info = ActivationLayerInfo()); - - // Inherited methods overridden: - void run(ITensorPack &tensors) override; - -private: - std::unique_ptr<ICLKernel> _border_handler{ nullptr }; }; } // namespace opencl } // namespace arm_compute |