diff options
author | Gian Marco Iodice <gianmarco.iodice@arm.com> | 2020-06-25 17:18:36 +0100 |
---|---|---|
committer | Gian Marco Iodice <gianmarco.iodice@arm.com> | 2020-06-26 10:15:10 +0000 |
commit | e5563d9b0102846973f144cba42fb9002bebd09b (patch) | |
tree | 3ede792d30aad726a81b371e34bae16f30f5d81c /arm_compute | |
parent | 6cb26ce7ff35e0c9b634160603560feeb23b0cee (diff) | |
download | ComputeLibrary-e5563d9b0102846973f144cba42fb9002bebd09b.tar.gz |
COMPMID-3560: Fix F16 performance regression (OpenCL)
The performance regression was caused by a change in the interface
of the OpenCL kernels gemm_mm_reshaped_lhs_*
Change-Id: I030df4975dc040886c17e71710a27137b50edd9b
Signed-off-by: Gian Marco Iodice <gianmarco.iodice@arm.com>
Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/3465
Tested-by: Arm Jenkins <bsgcomp@arm.com>
Reviewed-by: Michele Di Giorgio <michele.digiorgio@arm.com>
Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'arm_compute')
-rw-r--r-- | arm_compute/core/CL/kernels/CLGEMMMatrixMultiplyReshapedKernel.h | 1 |
1 files changed, 1 insertions, 0 deletions
diff --git a/arm_compute/core/CL/kernels/CLGEMMMatrixMultiplyReshapedKernel.h b/arm_compute/core/CL/kernels/CLGEMMMatrixMultiplyReshapedKernel.h index aeedd50e0b..52bc7c45ce 100644 --- a/arm_compute/core/CL/kernels/CLGEMMMatrixMultiplyReshapedKernel.h +++ b/arm_compute/core/CL/kernels/CLGEMMMatrixMultiplyReshapedKernel.h @@ -182,6 +182,7 @@ private: bool _add_bias; bool _broadcast_bias; bool _export_to_cl_image; + unsigned int _k; }; } // namespace arm_compute #endif /*ARM_COMPUTE_CLGEMMMATRIXMULTIPLYRESHAPEDKERNEL_H*/
\ No newline at end of file |