aboutsummaryrefslogtreecommitdiff
path: root/docs
diff options
context:
space:
mode:
authorSiCong Li <sicong.li@arm.com>2019-11-28 11:06:11 +0000
committerSiCong Li <sicong.li@arm.com>2019-11-28 12:20:24 +0000
commitca1f98c58dac53c94d09aa66de1c0a5fb85c9358 (patch)
tree1bfd130d77722545abec36e81e10b4a803254864 /docs
parent14cbfb2921990d8bf125231e350e2ac8dcd95a8b (diff)
downloadComputeLibrary-ca1f98c58dac53c94d09aa66de1c0a5fb85c9358.tar.gz
COMPMID-2911 Update README and Doxygen for 19.11
Signed-off-by: SiCong Li <sicong.li@arm.com> Change-Id: I4b8bc6970d8c0597a18115dc3414fdc9e05b91d9 Reviewed-on: https://review.mlplatform.org/c/2382 Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com> Comments-Addressed: Arm Jenkins <bsgcomp@arm.com> Tested-by: Arm Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'docs')
-rw-r--r--docs/00_introduction.dox57
-rw-r--r--docs/06_functions_list.dox13
2 files changed, 66 insertions, 4 deletions
diff --git a/docs/00_introduction.dox b/docs/00_introduction.dox
index 7f44fb7479..476023b43f 100644
--- a/docs/00_introduction.dox
+++ b/docs/00_introduction.dox
@@ -237,16 +237,69 @@ If there is more than one release in a month then an extra sequential number is
@subsection S2_2_changelog Changelog
v19.11 Public major release
- - Deprecated OpenCL kernels / functions
+ - Various bug fixes.
+ - Various optimisations.
+ - Deprecated OpenCL kernels / functions:
- CLDepthwiseConvolutionLayerReshapeWeightsGenericKernel
- CLDepthwiseIm2ColKernel
+ - CLDepthwiseSeparableConvolutionLayer
- CLDepthwiseVectorToTensorKernel
- CLDirectConvolutionLayerOutputStageKernel
- - Deprecated NEON kernels / functions
+ - Deprecated NEON kernels / functions:
- NEDepthwiseWeightsReshapeKernel
- NEDepthwiseIm2ColKernel
+ - NEDepthwiseSeparableConvolutionLayer
- NEDepthwiseVectorToTensorKernel
- NEDepthwiseConvolutionLayer3x3
+ - New OpenCL kernels / functions:
+ - @ref CLInstanceNormalizationLayerKernel / @ref CLInstanceNormalizationLayer
+ - @ref CLDepthwiseConvolutionLayerNativeKernel to replace the old generic depthwise convolution (see Deprecated
+ OpenCL kernels / functions)
+ - @ref CLLogSoftmaxLayer
+ - New NEON kernels / functions:
+ - @ref NEBoundingBoxTransformKernel / @ref NEBoundingBoxTransform
+ - @ref NEComputeAllAnchorsKernel / @ref NEComputeAllAnchors
+ - @ref NEDetectionPostProcessLayer
+ - @ref NEGenerateProposalsLayer
+ - @ref NEInstanceNormalizationLayerKernel / @ref NEInstanceNormalizationLayer
+ - @ref NELogSoftmaxLayer
+ - @ref NEROIAlignLayerKernel / @ref NEROIAlignLayer
+ - Added QASYMM8 support for:
+ - @ref CLGenerateProposalsLayer
+ - @ref CLROIAlignLayer
+ - @ref CPPBoxWithNonMaximaSuppressionLimit
+ - Added QASYMM16 support for:
+ - @ref CLBoundingBoxTransform
+ - Added FP16 support for:
+ - @ref CLGEMMMatrixMultiplyReshapedKernel
+ - Added new data type QASYMM8_PER_CHANNEL support for:
+ - @ref CLDequantizationLayer
+ - @ref NEDequantizationLayer
+ - Added new data type QSYMM8_PER_CHANNEL support for:
+ - @ref CLConvolutionLayer
+ - @ref NEConvolutionLayer
+ - @ref CLDepthwiseConvolutionLayer
+ - @ref NEDepthwiseConvolutionLayer
+ - Added FP16 mixed-precision support for:
+ - @ref CLGEMMMatrixMultiplyReshapedKernel
+ - @ref CLPoolingLayerKernel
+ - Added FP32 and FP16 ELU activation for:
+ - @ref CLActivationLayer
+ - @ref NEActivationLayer
+ - Added asymmetric padding support for:
+ - @ref CLDirectDeconvolutionLayer
+ - @ref CLGEMMDeconvolutionLayer
+ - @ref NEDeconvolutionLayer
+ - Added SYMMETRIC and REFLECT modes for @ref CLPadLayerKernel / @ref CLPadLayer.
+ - Replaced the calls to @ref NECopyKernel and @ref NEMemsetKernel with @ref NEPadLayer in @ref NEGenerateProposalsLayer.
+ - Replaced the calls to @ref CLCopyKernel and @ref CLMemsetKernel with @ref CLPadLayer in @ref CLGenerateProposalsLayer.
+ - Improved performance for CL Inception V3 - FP16.
+ - Improved accuracy for CL Inception V3 - FP16 by enabling FP32 accumulator (mixed-precision).
+ - Improved NEON performance by enabling fusing batch normalization with convolution and depth-wise convolution layer.
+ - Improved NEON performance for MobileNet-SSD by improving the output detection performance.
+ - Optimized @ref CLPadLayer.
+ - Optimized CL generic depthwise convolution layer by introducing @ref CLDepthwiseConvolutionLayerNativeKernel.
+ - Reduced memory consumption by implementing weights sharing.
v19.08 Public major release
- Various bug fixes.
diff --git a/docs/06_functions_list.dox b/docs/06_functions_list.dox
index 8143d827ae..30b522bb2a 100644
--- a/docs/06_functions_list.dox
+++ b/docs/06_functions_list.dox
@@ -36,8 +36,11 @@ namespace arm_compute
- @ref NEAbsoluteDifference
- @ref NEArithmeticAddition
- @ref NEArithmeticSubtraction
+ - @ref NEBoundingBoxTransform
- @ref NEBox3x3
- @ref NECast
+ - @ref NEComplexPixelWiseMultiplication
+ - @ref NEComputeAllAnchors
- @ref NEConvolution3x3
- @ref NEConvolutionRectangle
- @ref NEDilate
@@ -57,6 +60,8 @@ namespace arm_compute
- @ref NEPixelWiseMultiplication
- @ref NEPReluLayer
- @ref NERemap
+ - @ref NEROIAlignLayer
+ - @ref NERoundLayer
- @ref NERsqrtLayer
- @ref NEScharr3x3
- @ref NESelect
@@ -120,6 +125,7 @@ namespace arm_compute
- @ref NEDepthwiseConvolutionLayerOptimized
- @ref NEDequantizationLayer
- @ref NEDerivative
+ - @ref NEDetectionPostProcessLayer
- @ref NEDirectConvolutionLayer
- @ref NEEqualizeHistogram
- @ref NEFastCorners
@@ -139,12 +145,14 @@ namespace arm_compute
- @ref NEGEMMConvolutionLayer
- @ref NEGEMMLowpAssemblyMatrixMultiplyCore
- @ref NEGEMMLowpMatrixMultiplyCore
+ - @ref NEGenerateProposalsLayer
- @ref NEHarrisCorners
- @ref NEHistogram
- @ref NEHOGDescriptor
- @ref NEHOGGradient
- @ref NEHOGMultiDetection
- @ref NEIm2Col
+ - @ref NEInstanceNormalizationLayer
- @ref NEL2NormalizeLayer
- @ref NELaplacianPyramid
- @ref NELaplacianReconstruct
@@ -167,7 +175,7 @@ namespace arm_compute
- @ref NESimpleAssemblyFunction
- @ref NESobel5x5
- @ref NESobel7x7
- - @ref NESoftmaxLayer
+ - @ref NESoftmaxLayerGeneric &lt;IS_LOG&gt;
- @ref NESpaceToBatchLayer
- @ref NESpaceToDepthLayer
- @ref NESplit
@@ -237,7 +245,7 @@ namespace arm_compute
- @ref CLRNNLayer
- @ref CLSobel5x5
- @ref CLSobel7x7
- - @ref CLSoftmaxLayer
+ - @ref CLSoftmaxLayerGeneric &lt;IS_LOG&gt;
- @ref CLSpaceToBatchLayer
- @ref CLSpaceToDepthLayer
- @ref CLSplit
@@ -358,6 +366,7 @@ namespace arm_compute
- @ref IFunction
- @ref CPPDetectionOutputLayer
+ - @ref CPPDetectionPostProcessLayer
- @ref ICPPSimpleFunction
- @ref CPPBoxWithNonMaximaSuppressionLimit
- @ref CPPPermute