diff options
author | Freddie Liardet <frederick.liardet@arm.com> | 2022-05-16 14:09:10 +0100 |
---|---|---|
committer | Gunes Bayir <gunes.bayir@arm.com> | 2022-07-22 10:18:41 +0000 |
commit | e572dff7adc334a98ac4a0326d66037451d5d079 (patch) | |
tree | 9c4db3d743078de9bda67dfed674e3f371a4e238 /Android.bp | |
parent | e87120731ca65c54b082734af07f748ac9651427 (diff) | |
download | ComputeLibrary-e572dff7adc334a98ac4a0326d66037451d5d079.tar.gz |
Add GemmLowp MMUL Reshaped Only Rhs Support for QASYMM8/QASYMM8_SIGNED
This patch introduces a GEMMLowp routine that is optimized for Arm(R) Mali(TM)-G715 and Arm(R) Mali(TM)-G615
Resolves: COMPMID-5398
Signed-off-by: Freddie Liardet <frederick.liardet@arm.com>
Signed-off-by: Gunes Bayir <gunes.bayir@arm.com>
Change-Id: I8d06453645688f3658b6c7c06f1ebc25a2505661
Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/7932
Tested-by: Arm Jenkins <bsgcomp@arm.com>
Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
Reviewed-by: SiCong Li <sicong.li@arm.com>
Reviewed-by: Pablo Marquez Tello <pablo.tello@arm.com>
Benchmark: Arm Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'Android.bp')
-rw-r--r-- | Android.bp | 2 |
1 files changed, 2 insertions, 0 deletions
diff --git a/Android.bp b/Android.bp index ad28cc35b3..4a6ba4f3ab 100644 --- a/Android.bp +++ b/Android.bp @@ -43,6 +43,7 @@ opencl_srcs = [ "src/core/CL/cl_kernels/common/gemm_reshaped_only_rhs_mmul.cl", "src/core/CL/cl_kernels/common/gemm_utils.cl", "src/core/CL/cl_kernels/common/gemmlowp.cl", + "src/core/CL/cl_kernels/common/gemmlowp_reshaped_only_rhs_mmul.cl", "src/core/CL/cl_kernels/common/gemv.cl", "src/core/CL/cl_kernels/common/generate_proposals.cl", "src/core/CL/cl_kernels/common/generate_proposals_quantized.cl", @@ -611,6 +612,7 @@ cc_library_static { "src/gpu/cl/kernels/ClGemmLowpMatrixMultiplyNativeKernel.cpp", "src/gpu/cl/kernels/ClGemmLowpMatrixMultiplyReshapedKernel.cpp", "src/gpu/cl/kernels/ClGemmLowpMatrixMultiplyReshapedOnlyRhsKernel.cpp", + "src/gpu/cl/kernels/ClGemmLowpMatrixMultiplyReshapedOnlyRhsMMULKernel.cpp", "src/gpu/cl/kernels/ClGemmLowpOffsetContributionKernel.cpp", "src/gpu/cl/kernels/ClGemmLowpOffsetContributionOutputStageKernel.cpp", "src/gpu/cl/kernels/ClGemmLowpQuantizeDownInt32ScaleByFixedPointKernel.cpp", |