diff options
author | Jakub Sujak <jakub.sujak@arm.com> | 2023-02-06 17:42:47 +0000 |
---|---|---|
committer | Jakub Sujak <jakub.sujak@arm.com> | 2023-02-10 12:20:58 +0000 |
commit | 06db85ed4ca0b3f4ee525787cb17a814a89006ce (patch) | |
tree | e31c7650bfac36cb6198e6edca663c9d992c5703 | |
parent | 8ee9425116fafa38b40822ecf43522a3c16e7679 (diff) | |
download | ComputeLibrary-06db85ed4ca0b3f4ee525787cb17a814a89006ce.tar.gz |
Update release version and change log documentation
Partially resolves: COMPMID-5565
Change-Id: I058bb7b0ee2ba246bf0f0c67d68e35b57326e802
Signed-off-by: Jakub Sujak <jakub.sujak@arm.com>
Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/9098
Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
Tested-by: Arm Jenkins <bsgcomp@arm.com>
Reviewed-by: Omar Al Khatib <omar.alkhatib@arm.com>
Reviewed-by: Pablo Marquez Tello <pablo.tello@arm.com>
Benchmark: Arm Jenkins <bsgcomp@arm.com>
-rw-r--r-- | docs/user_guide/release_version_and_change_log.dox | 18 |
1 files changed, 18 insertions, 0 deletions
diff --git a/docs/user_guide/release_version_and_change_log.dox b/docs/user_guide/release_version_and_change_log.dox index 22adf92d91..d658b5354f 100644 --- a/docs/user_guide/release_version_and_change_log.dox +++ b/docs/user_guide/release_version_and_change_log.dox @@ -41,7 +41,25 @@ If there is more than one release in a month then an extra sequential number is @section S2_2_changelog Changelog v23.02 Public major release + - New features: + - Rework the experimental dynamic fusion interface by identifying auxiliary and intermediate tensors, and specifying an explicit output operator. + - Add the following operators to the experimental dynamic fusion API: + - GpuAdd, GpuCast, GpuClamp, GpuDepthwiseConv2d, GpuMul, GpuOutput, GpuPool2d, GpuReshape, GpuResize, GpuSoftmax, GpuSub. + - Add SME/SME2 kernels for GeMM, Winograd convolution, Depthwise convolution and Pooling. + - Add new flag @ref ITensorInfo::lock_paddings() to tensors to prevent extending tensor paddings. + - Add new OpenCL kernel to compute indirect convolution: + - \link opencl::kernels::ClIndirectConv2dKernel ClIndirectConv2dKernel \endlink + - Add new OpenCL kernel to compute transposed convolution: + - \link opencl::kernels::ClTransposedConvolutionKernel ClTransposedConvolutionKernel \endlink + - Add experimental support for CPU only Bazel and CMake builds. + - Performance optimizations: + - Optimize CPU base-e exponential functions for FP32. + - Optimize CPU StridedSlice by copying first dimension elements in bulk where possible. + - Optimize CPU quantized Subtraction by reusing the quantized Addition kernel. + - Optimize CPU ReduceMean by removing quantization steps and performing the operation in integer domain. + - Optimize GPU Scale and Dynamic Fusion GpuResize by removing quantization steps and performing the operation in integer domain. - Update recommended/minimum NDK version to r20b. + - Various optimizations and bug fixes. v22.11 Public major release - New features: |