diff options
author | Michele Di Giorgio <michele.digiorgio@arm.com> | 2021-08-26 14:47:55 +0100 |
---|---|---|
committer | Michele Di Giorgio <michele.digiorgio@arm.com> | 2021-09-01 22:11:21 +0000 |
commit | 50335fd3d0734157382741fcf1bfdaf630c60c4b (patch) | |
tree | aa6a8e65cbd2245b84d5ca9c4df7ff5f6735e982 /src/gpu/cl/kernels/ClGemmMatrixMultiplyReshapedKernel.h | |
parent | e2276cb18f9dffd484bc5d11f65f8526fb32d5bd (diff) | |
download | ComputeLibrary-50335fd3d0734157382741fcf1bfdaf630c60c4b.tar.gz |
Remove padding from ClGemmMatrixMultiplyReshapedKernel
Create new macros for loading values from memory while being aware of
boundaries of the tensor to not generate page faults.
Resolves: COMPMID-4447
Change-Id: If9a455291e395ebd9070ebe5e120b3064d8fab29
Signed-off-by: Michele Di Giorgio <michele.digiorgio@arm.com>
Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/6168
Tested-by: Arm Jenkins <bsgcomp@arm.com>
Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com>
Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'src/gpu/cl/kernels/ClGemmMatrixMultiplyReshapedKernel.h')
-rw-r--r-- | src/gpu/cl/kernels/ClGemmMatrixMultiplyReshapedKernel.h | 2 |
1 files changed, 1 insertions, 1 deletions
diff --git a/src/gpu/cl/kernels/ClGemmMatrixMultiplyReshapedKernel.h b/src/gpu/cl/kernels/ClGemmMatrixMultiplyReshapedKernel.h index 435a3a67f6..b320d318e9 100644 --- a/src/gpu/cl/kernels/ClGemmMatrixMultiplyReshapedKernel.h +++ b/src/gpu/cl/kernels/ClGemmMatrixMultiplyReshapedKernel.h @@ -84,7 +84,7 @@ public: * @note lhs_info.k0 must be equal to rhs_info.k0 */ void configure(const ClCompileContext &compile_context, - ITensorInfo *src0, ITensorInfo *src1, ITensorInfo *src2, ITensorInfo *dst, float alpha, float beta, + const ITensorInfo *src0, const ITensorInfo *src1, const ITensorInfo *src2, ITensorInfo *dst, float alpha, float beta, const GEMMLHSMatrixInfo &lhs_info, const GEMMRHSMatrixInfo &rhs_info, const GEMMKernelInfo &gemm_info); /** Static function to check if given info will lead to a valid configuration * |