diff options
author | Gian Marco Iodice <gianmarco.iodice@arm.com> | 2020-08-12 10:25:29 +0100 |
---|---|---|
committer | Gian Marco Iodice <gianmarco.iodice@arm.com> | 2020-08-13 14:31:08 +0000 |
commit | 547b2e7aa07db4dd41f99e492c40710f2548c6ba (patch) | |
tree | 98d2abf40356901923e07ef6ecf673d7f22077ed /docs | |
parent | d72bd12f63601d873bd2a567973950333e0076d2 (diff) | |
download | ComputeLibrary-547b2e7aa07db4dd41f99e492c40710f2548c6ba.tar.gz |
COMPMID-3702: Update documentation
- Update documentation about remove padding in GEMM - OpenCL
- Update documentation about the OpenCL image object support in GEMM
Change-Id: I015193ee5c5b946cf053968eeeacc042b33b6f6e
Signed-off-by: Gian Marco Iodice <gianmarco.iodice@arm.com>
Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/3728
Tested-by: Arm Jenkins <bsgcomp@arm.com>
Reviewed-by: Sheri Zhang <sheri.zhang@arm.com>
Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'docs')
-rw-r--r-- | docs/00_introduction.dox | 8 |
1 files changed, 8 insertions, 0 deletions
diff --git a/docs/00_introduction.dox b/docs/00_introduction.dox index 90064399e7..c281cb21b7 100644 --- a/docs/00_introduction.dox +++ b/docs/00_introduction.dox @@ -270,6 +270,14 @@ v20.08 Public major release The default "axis" value for @ref NESoftmaxLayer, @ref NELogSoftmaxLayer is changed from 1 to 0. Only axis 0 is supported. - The support for quantized data types has been removed from @ref CLLogSoftmaxLayer due to implementation complexity. + - Removed padding requirement for the input (e.g. LHS of GEMM) and output in @ref CLGEMMMatrixMultiplyNativeKernel, @ref CLGEMMMatrixMultiplyReshapedKernel, @ref CLGEMMMatrixMultiplyReshapedOnlyRHSKernel and @ref CLIm2ColKernel (NHWC only) + - This change allows to use @ref CLGEMMConvolutionLayer without extra padding for the input and output. + - Only the weights/bias of @ref CLGEMMConvolutionLayer could require padding for the computation. + - Only on Arm Mali Midgard GPUs, @ref CLGEMMConvolutionLayer could require padding since @ref CLGEMMMatrixMultiplyKernel is called and currently requires padding. + - Added support for exporting the OpenCL buffer object to the OpenCL image object in @ref CLGEMMMatrixMultiplyReshapedKernel and @ref CLGEMMMatrixMultiplyReshapedOnlyRHSKernel. + - This support allows to export the OpenCL buffer used for the reshaped RHS matrix to the OpenCL image object. + - The padding requirement for the OpenCL image object is considered into the @ref CLGEMMReshapeRHSMatrixKernel. + - The reshaped RHS matrix stores the weights when GEMM is used to accelerate @ref CLGEMMConvolutionLayer. v20.05 Public major release - Various bug fixes. |