diff options
author | Mohammed Suhail Munshi <MohammedSuhail.Munshi@arm.com> | 2022-01-19 12:22:50 +0000 |
---|---|---|
committer | SiCong Li <sicong.li@arm.com> | 2022-01-21 10:56:35 +0000 |
commit | 066607f1384f502612869196c97b17ed0fc4caf3 (patch) | |
tree | 0a5d37650e551c4f3c49fd7e623498bd02c8ebdd /src/cpu/kernels/assembly/CpuGemmAssemblyWrapperKernel.h | |
parent | d5c496d87e3b446532dd3dd163e9768de0daff4e (diff) | |
download | ComputeLibrary-066607f1384f502612869196c97b17ed0fc4caf3.tar.gz |
A73 Devices Regression 300% fix
- Currently regresses on A73 devices (tested on android hikey, inceptionv3), this patch solves this
- Changed mws for all cores to use default values
- Existing mws value for A73 tuned for hikey-linux, caused regression on hikey-android
Resolves [COMPMID-5044]
Change-Id: Ifd6faaa34a0b405d0c390015566f2c75436dfb07
Signed-off-by: Mohammed Suhail Munshi <MohammedSuhail.Munshi@arm.com>
Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/6973
Tested-by: Arm Jenkins <bsgcomp@arm.com>
Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
Reviewed-by: SiCong Li <sicong.li@arm.com>
Diffstat (limited to 'src/cpu/kernels/assembly/CpuGemmAssemblyWrapperKernel.h')
-rw-r--r-- | src/cpu/kernels/assembly/CpuGemmAssemblyWrapperKernel.h | 18 |
1 files changed, 4 insertions, 14 deletions
diff --git a/src/cpu/kernels/assembly/CpuGemmAssemblyWrapperKernel.h b/src/cpu/kernels/assembly/CpuGemmAssemblyWrapperKernel.h index 212fd79306..10bf8e4ff7 100644 --- a/src/cpu/kernels/assembly/CpuGemmAssemblyWrapperKernel.h +++ b/src/cpu/kernels/assembly/CpuGemmAssemblyWrapperKernel.h @@ -1,5 +1,5 @@ /* - * Copyright (c) 2018-2021 Arm Limited. + * Copyright (c) 2018-2022 Arm Limited. * * SPDX-License-Identifier: MIT * @@ -125,19 +125,9 @@ public: size_t get_mws(const CPUInfo &platform, size_t thread_count) const override { ARM_COMPUTE_UNUSED(thread_count); - // Tuning results that gave optimized results in performance investigation - if (platform.get_cpu_model() == CPUModel::A73 ) - { - return 3072; - } - else if (platform.get_cpu_model() == CPUModel::A76) - { - return 4096; - } - else - { - return ICPPKernel::default_mws; - } + ARM_COMPUTE_UNUSED(platform); + + return ICPPKernel::default_mws; } private: |