aboutsummaryrefslogtreecommitdiff
path: root/docs/user_guide/release_version_and_change_log.dox
diff options
context:
space:
mode:
authorJakub Sujak <jakub.sujak@arm.com>2023-02-10 14:36:48 +0000
committerJakub Sujak <jakub.sujak@arm.com>2023-02-10 17:35:29 +0000
commit9eefd4b371ecf32600ad8b3482cd7474f59106ac (patch)
tree848ba50b086899f9e156b0bfb82adb2e1921c8eb /docs/user_guide/release_version_and_change_log.dox
parentf4230aafcc106305d4da9798bd5211fe7a7809ea (diff)
downloadComputeLibrary-9eefd4b371ecf32600ad8b3482cd7474f59106ac.tar.gz
Update release version and change log documentation
Resolves: COMPMID-5565 Change-Id: I9dca679f57f6c3cc9489669b80a5da2aba500d34 Signed-off-by: Jakub Sujak <jakub.sujak@arm.com> Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/9122 Tested-by: Arm Jenkins <bsgcomp@arm.com> Reviewed-by: SiCong Li <sicong.li@arm.com> Comments-Addressed: Arm Jenkins <bsgcomp@arm.com> Benchmark: Arm Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'docs/user_guide/release_version_and_change_log.dox')
-rw-r--r--docs/user_guide/release_version_and_change_log.dox10
1 files changed, 6 insertions, 4 deletions
diff --git a/docs/user_guide/release_version_and_change_log.dox b/docs/user_guide/release_version_and_change_log.dox
index d658b5354f..8bb2a3f305 100644
--- a/docs/user_guide/release_version_and_change_log.dox
+++ b/docs/user_guide/release_version_and_change_log.dox
@@ -46,11 +46,8 @@ v23.02 Public major release
- Add the following operators to the experimental dynamic fusion API:
- GpuAdd, GpuCast, GpuClamp, GpuDepthwiseConv2d, GpuMul, GpuOutput, GpuPool2d, GpuReshape, GpuResize, GpuSoftmax, GpuSub.
- Add SME/SME2 kernels for GeMM, Winograd convolution, Depthwise convolution and Pooling.
+ - Add new CPU operator AddMulAdd for float and quantized types.
- Add new flag @ref ITensorInfo::lock_paddings() to tensors to prevent extending tensor paddings.
- - Add new OpenCL kernel to compute indirect convolution:
- - \link opencl::kernels::ClIndirectConv2dKernel ClIndirectConv2dKernel \endlink
- - Add new OpenCL kernel to compute transposed convolution:
- - \link opencl::kernels::ClTransposedConvolutionKernel ClTransposedConvolutionKernel \endlink
- Add experimental support for CPU only Bazel and CMake builds.
- Performance optimizations:
- Optimize CPU base-e exponential functions for FP32.
@@ -58,6 +55,11 @@ v23.02 Public major release
- Optimize CPU quantized Subtraction by reusing the quantized Addition kernel.
- Optimize CPU ReduceMean by removing quantization steps and performing the operation in integer domain.
- Optimize GPU Scale and Dynamic Fusion GpuResize by removing quantization steps and performing the operation in integer domain.
+ - Update the heuristic for CLDepthwiseConvolutionNative kernel.
+ - Add new optimized OpenCL kernel to compute indirect convolution:
+ - \link opencl::kernels::ClIndirectConv2dKernel ClIndirectConv2dKernel \endlink
+ - Add new optimized OpenCL kernel to compute transposed convolution:
+ - \link opencl::kernels::ClTransposedConvolutionKernel ClTransposedConvolutionKernel \endlink
- Update recommended/minimum NDK version to r20b.
- Various optimizations and bug fixes.