aboutsummaryrefslogtreecommitdiff
path: root/docs/user_guide
diff options
context:
space:
mode:
authorJonathan Deakin <jonathan.deakin@arm.com>2024-02-05 15:32:31 +0000
committerJonathan Deakin <jonathan.deakin@arm.com>2024-02-07 09:15:39 +0000
commit2db938cd1d026deb3689c668dd7031c00b9b339d (patch)
treea1f046ae6a53cddf3772a895107be9457f3359cd /docs/user_guide
parente695579911fbe6aa06b11dbeeec7af5637a92f2b (diff)
downloadComputeLibrary-2db938cd1d026deb3689c668dd7031c00b9b339d.tar.gz
Parallelize CPU depthwise over batch if only 1 row
This patch also fixes a bug where the split dimension was wrong in CpuDepthwiseConv2dAssemblyDispatch::run. It was set to DimY, which is cols, but it should have been DimZ. This was rarely an issue in practice because typically the number of cols are greater than the number of threads anyway. Relates to: ONCPUML-1443 Co-authored-by: Milos Puzovic <Milos.Puzovic@arm.com> Change-Id: Ifed2fce22ddeb7cd77e6a6ae1083694427f91e04 Signed-off-by: Jonathan Deakin <jonathan.deakin@arm.com> Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/11083 Benchmark: Arm Jenkins <bsgcomp@arm.com> Tested-by: Arm Jenkins <bsgcomp@arm.com> Reviewed-by: Jakub Sujak <jakub.sujak@arm.com> Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'docs/user_guide')
-rw-r--r--docs/user_guide/release_version_and_change_log.dox6
1 files changed, 5 insertions, 1 deletions
diff --git a/docs/user_guide/release_version_and_change_log.dox b/docs/user_guide/release_version_and_change_log.dox
index 40ad09fd84..b1a4395eb5 100644
--- a/docs/user_guide/release_version_and_change_log.dox
+++ b/docs/user_guide/release_version_and_change_log.dox
@@ -1,5 +1,5 @@
///
-/// Copyright (c) 2017-2023 Arm Limited.
+/// Copyright (c) 2017-2024 Arm Limited.
///
/// SPDX-License-Identifier: MIT
///
@@ -41,6 +41,10 @@ If there is more than one release in a month then an extra sequential number is
@section S2_2_changelog Changelog
+v24.02 Public major release
+ - Performance optimizations:
+ - Parallelize @ref NEDepthwiseConvolutionLayer over batches if there is only 1 row
+
v24.01 Public major release
- Remove the legacy 'libarm_compute_core' library. This library is an artifact of Compute Library's legacy library architecture and no longer serves any purpose.
You should link only to the main `libarm_compute` library for core functionality.