aboutsummaryrefslogtreecommitdiff
path: root/src/common/cpuinfo/CpuIsaInfo.cpp
diff options
context:
space:
mode:
authorGunes Bayir <gunes.bayir@arm.com>2024-03-04 14:55:24 +0000
committerGunes Bayir <gunes.bayir@arm.com>2024-03-04 16:37:14 +0000
commitbf053734c468ebc9fd2e535ab8c357b55fdaad43 (patch)
tree85a292b8a31c3bec2c3f4d32a5cec6ac48741a36 /src/common/cpuinfo/CpuIsaInfo.cpp
parent6fe9eafe0707387e65f9b3c188f4145f64415ce3 (diff)
downloadComputeLibrary-bf053734c468ebc9fd2e535ab8c357b55fdaad43.tar.gz
Fix performance regression in fixed-format kernels
Fix the performance regression in CpuGemmConv2d caused by importing memory at every run for fixed-format kernels. This has been done by adding an bypass_import parameter to the aux. tensor handler class (CpuAuxTensorHandler) and using it in CpuGemmConv2d so that the memory import happens if and only when the associated tensor is used in the gemm pack. Also, improve the documentation of CpuAuxTensorHandler. Resolves: ARMCL-1126 Co-authored by: SiCong Li <sicong.li@arm.com> Change-Id: Idb26bdb2d19419074a6e7f2497a1741ae200603f Signed-off-by: Gunes Bayir <gunes.bayir@arm.com> Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/11240 Tested-by: Arm Jenkins <bsgcomp@arm.com> Reviewed-by: Pablo Marquez Tello <pablo.tello@arm.com> Comments-Addressed: Arm Jenkins <bsgcomp@arm.com> Benchmark: Arm Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'src/common/cpuinfo/CpuIsaInfo.cpp')
0 files changed, 0 insertions, 0 deletions