diff options
author | Jakub Sujak <jakub.sujak@arm.com> | 2023-07-31 13:36:58 +0100 |
---|---|---|
committer | Jakub Sujak <jakub.sujak@arm.com> | 2023-08-07 08:44:17 +0000 |
commit | e1c96e7e6dbf5314676fc81831e2ccb34a031ea1 (patch) | |
tree | f69ee0ac5dd6b29de0041293b279804bcecf0df7 /docs/user_guide | |
parent | 78ce2730ecd2f1e666cdd10263bf054c0b740a9c (diff) | |
download | ComputeLibrary-e1c96e7e6dbf5314676fc81831e2ccb34a031ea1.tar.gz |
Port DirectConv2d to CKW backend
Ports the direct convolution 2D kernel from the experimental Dynamic Fusion interface to use the new Compute Kernel Writer backend for OpenCL code generation.
Support is for FP16/FP32 only.
Resolves: COMPMID-6259
Change-Id: Ia8d7b9cb789737b22b1d877cd798a73eda0ce4ab
Signed-off-by: Jakub Sujak <jakub.sujak@arm.com>
Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/10059
Reviewed-by: Gunes Bayir <gunes.bayir@arm.com>
Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com>
Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
Tested-by: Arm Jenkins <bsgcomp@arm.com>
Benchmark: Arm Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'docs/user_guide')
-rw-r--r-- | docs/user_guide/release_version_and_change_log.dox | 7 |
1 files changed, 7 insertions, 0 deletions
diff --git a/docs/user_guide/release_version_and_change_log.dox b/docs/user_guide/release_version_and_change_log.dox index 801f1f0b0f..edc0c3b9e7 100644 --- a/docs/user_guide/release_version_and_change_log.dox +++ b/docs/user_guide/release_version_and_change_log.dox @@ -48,6 +48,13 @@ v23.08 Public major release - @ref opencl::kernels::ClMatMulNativeMMULKernel support for FP32 and FP16, with batch support - Enable transposed convolution with non-square kernels on CPU and GPU. - Added support for input data type U64/S64 in CLCast. + - Added new Compute Kernel Writer (CKW) subproject that offers a C++ interface to generate tile-based OpenCL code in just-in-time fashion. + - Port the following kernels in the experimental Dynamic Fusion interface to use the new Compute Kernel Writer interface with support for FP16/FP32 only: + - @ref experimental::dynamic_fusion::GpuCkwActivation + - @ref experimental::dynamic_fusion::GpuCkwCast + - @ref experimental::dynamic_fusion::GpuCkwDirectConv2d + - @ref experimental::dynamic_fusion::GpuCkwElementwiseBinary + - @ref experimental::dynamic_fusion::GpuCkwStore - Various optimizations and bug fixes. v23.05.1 Public patch release |