From 407c102b62632185e06b5e75553f7b0cdbc46cdb Mon Sep 17 00:00:00 2001 From: SiCong Li Date: Mon, 25 Nov 2019 19:15:56 +0000 Subject: COMPMID-2911 Update README and Doxygen for 19.11 Change-Id: Ibf0a0b5dc30471efb9089a7154f0204f6568f531 Signed-off-by: SiCong Li Reviewed-on: https://review.mlplatform.org/c/2369 Reviewed-by: Georgios Pinitas Tested-by: Georgios Pinitas --- docs/00_introduction.dox | 57 ++++++++++++++++++++++++++++++++++++++++++++++-- 1 file changed, 55 insertions(+), 2 deletions(-) (limited to 'docs/00_introduction.dox') diff --git a/docs/00_introduction.dox b/docs/00_introduction.dox index 7f44fb7479..24e119c277 100644 --- a/docs/00_introduction.dox +++ b/docs/00_introduction.dox @@ -237,16 +237,69 @@ If there is more than one release in a month then an extra sequential number is @subsection S2_2_changelog Changelog v19.11 Public major release - - Deprecated OpenCL kernels / functions + - Various bug fixes. + - Various optimisations. + - Deprecated OpenCL kernels / functions: - CLDepthwiseConvolutionLayerReshapeWeightsGenericKernel - CLDepthwiseIm2ColKernel + - CLDepthwiseSeparableConvolutionLayer - CLDepthwiseVectorToTensorKernel - CLDirectConvolutionLayerOutputStageKernel - - Deprecated NEON kernels / functions + - Deprecated NEON kernels / functions: - NEDepthwiseWeightsReshapeKernel - NEDepthwiseIm2ColKernel + - NEDepthwiseSeparableConvolutionLayer - NEDepthwiseVectorToTensorKernel - NEDepthwiseConvolutionLayer3x3 + - New OpenCL kernels / functions: + - @ref CLInstanceNormalizationLayerKernel / @ref CLInstanceNormalizationLayer + - @ref CLDepthwiseConvolutionLayerNativeKernel to replace the old generic depthwise convolution (see Deprecated + OpenCL kernels / functions) + - @ref CLLogSoftmaxLayer + - New NEON kernels / functions: + - @ref NEBoundingBoxTransformKernel / @ref NEBoundingBoxTransform + - @ref NEComputeAllAnchorsKernel / @ref NEComputeAllAnchors + - @ref NEDetectionPostProcessLayer + - @ref NEGenerateProposalsLayer + - @ref NEInstanceNormalizationLayerKernel / @ref NEInstanceNormalizationLayer + - @ref NELogSoftmaxLayer + - @ref NEROIAlignLayerKernel / @ref NEROIAlignLayer + - Added QASYMM8 support for: + - @ref CLGenerateProposalsLayer + - @ref CLROIAlignLayer + - @ref CPPBoxWithNonMaximaSuppressionLimit + - Added QASYMM16 support for: + - @ref CLBoundingBoxTransform + - Added FP16 support for: + - @ref CLGEMMMatrixMultiplyReshapedKernel + - Added new data type QASYMM8_PER_CHANNEL support for: + - @ref CLDequantizationLayer + - @ref NEDequantizationLayer + - Added new data type QSYMM8_PER_CHANNEL support for: + - @ref CLConvolutionLayer + - @ref NEConvolutionLayer + - @ref CLDepthwiseConvolutionLayer + - @ref NEDepthwiseConvolutionLayer + - Added FP16 mixed-precision support for: + - @ref CLGEMMMatrixMultiplyReshapedKernel + - @ref CLPoolingLayerKernel + - Added FP32 and FP16 ELU activation for: + - @ref CLActivationLayer + - @ref NEActivationLayer + - Added asymmetric padding support for: + - @ref CLDirectDeconvolutionLayer + - @ref CLGEMMDeconvolutionLayer + - @ref NEDeconvolutionLayer + - Added SYMMETRIC and REFLECT modes for @ref CLPadLayerKernel / @ref CLPadLayer. + - Replaced the calls to @ref NECopyKernel and @ref NEMemsetKernel with @ref NEPadLayer in @ref NEGenerateProposals. + - Replaced the calls to @ref CLCopyKernel and @ref CLMemsetKernel with @ref CLPadLayer in @ref CLGenerateProposals. + - Improved performance for CL Inception V3 - FP16. + - Improved accuracy for CL Inception V3 - FP16 by enabling FP32 accumulator (mixed-precision). + - Improved NEON performance by enabling fusing batch normalization with convolution and depth-wise convolution layer. + - Improved NEON performance for MobileNet-SSD by improving the output detection performance. + - Optimized @ref CLPadLayer. + - Optimized CL generic depthwise convolution layer by introducing @ref CLDepthwiseConvolutionLayerNativeKernel. + - Reduced memory consumption by implementing weights sharing. v19.08 Public major release - Various bug fixes. -- cgit v1.2.1