diff options
author | Jonathan Deakin <jonathan.deakin@arm.com> | 2024-01-15 14:08:21 +0000 |
---|---|---|
committer | Jonathan Deakin <jonathan.deakin@arm.com> | 2024-02-05 10:22:53 +0000 |
commit | 0c17c4b42e6b819b0140f36527dbaf6166f37b2b (patch) | |
tree | 053dfedb4952b2fc1c248a3b9085e65709584266 /tests/validation/NEON | |
parent | 9b72a6c362abe5e5c23fa6a8bcca4ff4339fb225 (diff) | |
download | ComputeLibrary-0c17c4b42e6b819b0140f36527dbaf6166f37b2b.tar.gz |
Fix leftover cols in CpuGemmLowpMatrixBReductionKernel
CpuGemmLowpMatrixBReductionKernel::run_internal randomly segfaults
because it reads out of bounds with vloadq. This doesn't trigger with
the unit tests because the read isn't out of bounds for the process, but
it can be seen clearly by running the following in debug mode
./examples/neon_gemm_qasymm8 1 1 1
The vloadq at src/cpu/kernels/CpuGemmLowpMatrixReductionKernel.cpp:353
accesses a quadword even though the input is a single byte.
relates to: ONCPUML-1444 MLINFSW-439 COMPMID-6844
Change-Id: I2ae5260c9f38d6d8149a6bcd5dc146b911209784
Signed-off-by: Jonathan Deakin <jonathan.deakin@arm.com>
Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/10966
Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
Reviewed-by: Jakub Sujak <jakub.sujak@arm.com>
Tested-by: Arm Jenkins <bsgcomp@arm.com>
Benchmark: Arm Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'tests/validation/NEON')
0 files changed, 0 insertions, 0 deletions