diff options
author | SiCong Li <sicong.li@arm.com> | 2019-11-28 11:06:11 +0000 |
---|---|---|
committer | SiCong Li <sicong.li@arm.com> | 2019-11-28 12:20:24 +0000 |
commit | ca1f98c58dac53c94d09aa66de1c0a5fb85c9358 (patch) | |
tree | 1bfd130d77722545abec36e81e10b4a803254864 /docs/00_introduction.dox | |
parent | 14cbfb2921990d8bf125231e350e2ac8dcd95a8b (diff) | |
download | ComputeLibrary-ca1f98c58dac53c94d09aa66de1c0a5fb85c9358.tar.gz |
COMPMID-2911 Update README and Doxygen for 19.11
Signed-off-by: SiCong Li <sicong.li@arm.com>
Change-Id: I4b8bc6970d8c0597a18115dc3414fdc9e05b91d9
Reviewed-on: https://review.mlplatform.org/c/2382
Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com>
Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
Tested-by: Arm Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'docs/00_introduction.dox')
-rw-r--r-- | docs/00_introduction.dox | 57 |
1 files changed, 55 insertions, 2 deletions
diff --git a/docs/00_introduction.dox b/docs/00_introduction.dox index 7f44fb7479..476023b43f 100644 --- a/docs/00_introduction.dox +++ b/docs/00_introduction.dox @@ -237,16 +237,69 @@ If there is more than one release in a month then an extra sequential number is @subsection S2_2_changelog Changelog v19.11 Public major release - - Deprecated OpenCL kernels / functions + - Various bug fixes. + - Various optimisations. + - Deprecated OpenCL kernels / functions: - CLDepthwiseConvolutionLayerReshapeWeightsGenericKernel - CLDepthwiseIm2ColKernel + - CLDepthwiseSeparableConvolutionLayer - CLDepthwiseVectorToTensorKernel - CLDirectConvolutionLayerOutputStageKernel - - Deprecated NEON kernels / functions + - Deprecated NEON kernels / functions: - NEDepthwiseWeightsReshapeKernel - NEDepthwiseIm2ColKernel + - NEDepthwiseSeparableConvolutionLayer - NEDepthwiseVectorToTensorKernel - NEDepthwiseConvolutionLayer3x3 + - New OpenCL kernels / functions: + - @ref CLInstanceNormalizationLayerKernel / @ref CLInstanceNormalizationLayer + - @ref CLDepthwiseConvolutionLayerNativeKernel to replace the old generic depthwise convolution (see Deprecated + OpenCL kernels / functions) + - @ref CLLogSoftmaxLayer + - New NEON kernels / functions: + - @ref NEBoundingBoxTransformKernel / @ref NEBoundingBoxTransform + - @ref NEComputeAllAnchorsKernel / @ref NEComputeAllAnchors + - @ref NEDetectionPostProcessLayer + - @ref NEGenerateProposalsLayer + - @ref NEInstanceNormalizationLayerKernel / @ref NEInstanceNormalizationLayer + - @ref NELogSoftmaxLayer + - @ref NEROIAlignLayerKernel / @ref NEROIAlignLayer + - Added QASYMM8 support for: + - @ref CLGenerateProposalsLayer + - @ref CLROIAlignLayer + - @ref CPPBoxWithNonMaximaSuppressionLimit + - Added QASYMM16 support for: + - @ref CLBoundingBoxTransform + - Added FP16 support for: + - @ref CLGEMMMatrixMultiplyReshapedKernel + - Added new data type QASYMM8_PER_CHANNEL support for: + - @ref CLDequantizationLayer + - @ref NEDequantizationLayer + - Added new data type QSYMM8_PER_CHANNEL support for: + - @ref CLConvolutionLayer + - @ref NEConvolutionLayer + - @ref CLDepthwiseConvolutionLayer + - @ref NEDepthwiseConvolutionLayer + - Added FP16 mixed-precision support for: + - @ref CLGEMMMatrixMultiplyReshapedKernel + - @ref CLPoolingLayerKernel + - Added FP32 and FP16 ELU activation for: + - @ref CLActivationLayer + - @ref NEActivationLayer + - Added asymmetric padding support for: + - @ref CLDirectDeconvolutionLayer + - @ref CLGEMMDeconvolutionLayer + - @ref NEDeconvolutionLayer + - Added SYMMETRIC and REFLECT modes for @ref CLPadLayerKernel / @ref CLPadLayer. + - Replaced the calls to @ref NECopyKernel and @ref NEMemsetKernel with @ref NEPadLayer in @ref NEGenerateProposalsLayer. + - Replaced the calls to @ref CLCopyKernel and @ref CLMemsetKernel with @ref CLPadLayer in @ref CLGenerateProposalsLayer. + - Improved performance for CL Inception V3 - FP16. + - Improved accuracy for CL Inception V3 - FP16 by enabling FP32 accumulator (mixed-precision). + - Improved NEON performance by enabling fusing batch normalization with convolution and depth-wise convolution layer. + - Improved NEON performance for MobileNet-SSD by improving the output detection performance. + - Optimized @ref CLPadLayer. + - Optimized CL generic depthwise convolution layer by introducing @ref CLDepthwiseConvolutionLayerNativeKernel. + - Reduced memory consumption by implementing weights sharing. v19.08 Public major release - Various bug fixes. |