From 0f7ef8ab2171093855a8f21bd39c8fd7066dd629 Mon Sep 17 00:00:00 2001 From: Georgios Pinitas Date: Sun, 10 Jan 2021 04:23:52 +0000 Subject: Make memset/copy functions state-less Port following functions: - NECopy - NEFill - NEPermute - NEReshapeLayer Signed-off-by: Georgios Pinitas Change-Id: I75f3f837012abab79c7dde9a20a34f64f75571d8 Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/4800 Tested-by: Arm Jenkins Reviewed-by: Michele Di Giorgio Comments-Addressed: Arm Jenkins --- docs/00_introduction.dox | 10 +++++----- 1 file changed, 5 insertions(+), 5 deletions(-) (limited to 'docs/00_introduction.dox') diff --git a/docs/00_introduction.dox b/docs/00_introduction.dox index b8bdabf534..5e8769c366 100644 --- a/docs/00_introduction.dox +++ b/docs/00_introduction.dox @@ -659,8 +659,8 @@ v19.11 Public major release - @ref CLGEMMDeconvolutionLayer - @ref NEDeconvolutionLayer - Added SYMMETRIC and REFLECT modes for @ref CLPadLayerKernel / @ref CLPadLayer. - - Replaced the calls to @ref NECopyKernel and @ref NEMemsetKernel with @ref NEPadLayer in @ref NEGenerateProposalsLayer. - - Replaced the calls to @ref CLCopyKernel and @ref CLMemsetKernel with @ref CLPadLayer in @ref CLGenerateProposalsLayer. + - Replaced the calls to NECopyKernel and NEMemsetKernel with @ref NEPadLayer in @ref NEGenerateProposalsLayer. + - Replaced the calls to CLCopyKernel and CLMemsetKernel with @ref CLPadLayer in @ref CLGenerateProposalsLayer. - Improved performance for CL Inception V3 - FP16. - Improved accuracy for CL Inception V3 - FP16 by enabling FP32 accumulator (mixed-precision). - Improved NEON performance by enabling fusing batch normalization with convolution and depth-wise convolution layer. @@ -828,7 +828,7 @@ v19.02 Public major release - @ref NEStackLayerKernel / @ref NEStackLayer - @ref NERangeKernel / @ref NERange - @ref NEPadLayer - - @ref NEMemsetKernel + - NEMemsetKernel - @ref NEGatherKernel / @ref NEGather - @ref NEElementwiseComparison - @ref NEElementwiseComparisonStatic @@ -1070,7 +1070,7 @@ v18.02 Public major release - New NEON kernels / functions - Added name() method to all kernels. - Added support for Winograd 5x5. - - @ref NEPermuteKernel / @ref NEPermute + - NEPermuteKernel / @ref NEPermute - @ref NEWinogradLayerTransformInputKernel / NEWinogradLayer - @ref NEWinogradLayerTransformOutputKernel / NEWinogradLayer - @ref NEWinogradLayerTransformWeightsKernel / NEWinogradLayer @@ -1183,7 +1183,7 @@ v17.09 Public major release - @ref NEQuantizationLayerKernel @ref NEMinMaxLayerKernel / @ref NEQuantizationLayer - @ref NEROIPoolingLayerKernel / @ref NEROIPoolingLayer - @ref NEReductionOperationKernel / @ref NEReductionOperation - - @ref NEReshapeLayerKernel / @ref NEReshapeLayer + - NEReshapeLayerKernel / @ref NEReshapeLayer - New OpenCL kernels / functions: - @ref CLDepthwiseConvolutionLayer3x3NCHWKernel @ref CLDepthwiseConvolutionLayer3x3NHWCKernel CLDepthwiseIm2ColKernel CLDepthwiseVectorToTensorKernel CLDepthwiseWeightsReshapeKernel / CLDepthwiseConvolutionLayer3x3 @ref CLDepthwiseConvolutionLayer CLDepthwiseSeparableConvolutionLayer -- cgit v1.2.1