diff options
author | Gunes Bayir <gunes.bayir@arm.com> | 2024-02-12 21:32:51 +0000 |
---|---|---|
committer | Gunes Bayir <gunes.bayir@arm.com> | 2024-02-21 10:36:22 +0000 |
commit | ef637398a8c2060e15de438020c53331da8bd6dd (patch) | |
tree | b1a1738736c9b6b49e76767e44bf4b77bf732876 /Android.bp | |
parent | 0a48c4c83b598991b4d4235f870c24d9e6634b20 (diff) | |
download | ComputeLibrary-ef637398a8c2060e15de438020c53331da8bd6dd.tar.gz |
Integrate new pretranspose_b_array with extra fused transpose of B
This patch fuses the transposition taking place in Acl with the transformations done in arm_gemm (called pretranspose_b_array) if the underlying kernel and transform supports it. This should improve start-up time (as it's for constant Rhs matrices) and memory footprint. The transformations in arm_gemm are kernel specific. The Rhs matrix is transformed into certain layouts to improve the performance.
Resolves: COMPMID-6595
Change-Id: Id2932dd966e59f903c279417bebcea83d9a42464
Signed-off-by: Gunes Bayir <gunes.bayir@arm.com>
Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/11144
Tested-by: Arm Jenkins <bsgcomp@arm.com>
Reviewed-by: Viet-Hoa Do <viet-hoa.do@arm.com>
Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
Benchmark: Arm Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'Android.bp')
-rw-r--r-- | Android.bp | 1 |
1 files changed, 1 insertions, 0 deletions
diff --git a/Android.bp b/Android.bp index 670138b209..0d087c943b 100644 --- a/Android.bp +++ b/Android.bp @@ -332,6 +332,7 @@ cc_library_static { "src/core/NEON/kernels/arm_gemm/gemm_quint8.cpp", "src/core/NEON/kernels/arm_gemm/gemm_uint16.cpp", "src/core/NEON/kernels/arm_gemm/gemm_uint8.cpp", + "src/core/NEON/kernels/arm_gemm/interleave-8way.cpp", "src/core/NEON/kernels/arm_gemm/interleave_indirect-sve.cpp", "src/core/NEON/kernels/arm_gemm/interleave_indirect.cpp", "src/core/NEON/kernels/arm_gemm/mergeresults-fp16.cpp", |