aboutsummaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorJakub Sujak <jakub.sujak@arm.com>2023-02-06 17:42:47 +0000
committerJakub Sujak <jakub.sujak@arm.com>2023-02-10 14:38:34 +0000
commitc7799a79a0dc2597423ea4733dbbd6280c0b5086 (patch)
tree9b4a895f1d9368b5939bc8727a5102b6361e0838
parent56a813665b962c4f7c0b2104a8ade7045c851311 (diff)
downloadComputeLibrary-c7799a79a0dc2597423ea4733dbbd6280c0b5086.tar.gz
Update release version and change log documentation
Partially resolves: COMPMID-5565 Change-Id: I058bb7b0ee2ba246bf0f0c67d68e35b57326e802 Signed-off-by: Jakub Sujak <jakub.sujak@arm.com> Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/9098 Comments-Addressed: Arm Jenkins <bsgcomp@arm.com> Tested-by: Arm Jenkins <bsgcomp@arm.com> Reviewed-by: Omar Al Khatib <omar.alkhatib@arm.com> Reviewed-by: Pablo Marquez Tello <pablo.tello@arm.com> Benchmark: Arm Jenkins <bsgcomp@arm.com>
-rw-r--r--docs/user_guide/release_version_and_change_log.dox18
1 files changed, 18 insertions, 0 deletions
diff --git a/docs/user_guide/release_version_and_change_log.dox b/docs/user_guide/release_version_and_change_log.dox
index 22adf92d91..d658b5354f 100644
--- a/docs/user_guide/release_version_and_change_log.dox
+++ b/docs/user_guide/release_version_and_change_log.dox
@@ -41,7 +41,25 @@ If there is more than one release in a month then an extra sequential number is
@section S2_2_changelog Changelog
v23.02 Public major release
+ - New features:
+ - Rework the experimental dynamic fusion interface by identifying auxiliary and intermediate tensors, and specifying an explicit output operator.
+ - Add the following operators to the experimental dynamic fusion API:
+ - GpuAdd, GpuCast, GpuClamp, GpuDepthwiseConv2d, GpuMul, GpuOutput, GpuPool2d, GpuReshape, GpuResize, GpuSoftmax, GpuSub.
+ - Add SME/SME2 kernels for GeMM, Winograd convolution, Depthwise convolution and Pooling.
+ - Add new flag @ref ITensorInfo::lock_paddings() to tensors to prevent extending tensor paddings.
+ - Add new OpenCL kernel to compute indirect convolution:
+ - \link opencl::kernels::ClIndirectConv2dKernel ClIndirectConv2dKernel \endlink
+ - Add new OpenCL kernel to compute transposed convolution:
+ - \link opencl::kernels::ClTransposedConvolutionKernel ClTransposedConvolutionKernel \endlink
+ - Add experimental support for CPU only Bazel and CMake builds.
+ - Performance optimizations:
+ - Optimize CPU base-e exponential functions for FP32.
+ - Optimize CPU StridedSlice by copying first dimension elements in bulk where possible.
+ - Optimize CPU quantized Subtraction by reusing the quantized Addition kernel.
+ - Optimize CPU ReduceMean by removing quantization steps and performing the operation in integer domain.
+ - Optimize GPU Scale and Dynamic Fusion GpuResize by removing quantization steps and performing the operation in integer domain.
- Update recommended/minimum NDK version to r20b.
+ - Various optimizations and bug fixes.
v22.11 Public major release
- New features: