armnn.git -

Age	Commit message (Collapse)	Author
2020-01-23	IVGCVSW-4156 Remove backend dependency on ProfilingService.hppv19.11.1 branches/armnn_19_11	Jim Flynn
	Change-Id: I3b18ede85408bdfbc5147396e183e87bdabd3754 Signed-off-by: Jim Flynn <jim.flynn@arm.com>
2020-01-23	IVGCVSW-4371 Fixed backend compile IsReshapeSupported failure	Jim Flynn
	Change-Id: I71617cc35620bc5ff4e36797e41a58f5f959c07d Signed-off-by: Jim Flynn <jim.flynn@arm.com>
2020-01-22	IVGCVSW-4358 Update version number to 19.11.01	Teresa Charlin
	Signed-off-by: Teresa Charlin <teresa.charlinreyes@arm.com> Change-Id: I2b5a935fd5b5a82efaa50f546f10c502854e2657
2019-12-19	IVGCVSW-4302 Depthwise CTS. Fix ReorderWeightChannelsForAcl.	Teresa Charlin
	Signed-off-by: Teresa Charlin <teresa.charlinreyes@arm.com> Change-Id: I8d2050f4478fe9d9cdf9e374b8906827cc769689
2019-12-18	IVGCVSW-4268 Print all Descriptors on dot graph	Jim Flynn
	Change-Id: Ibc174f244bc72ff928879b0ea32f4f2c51f3a3b7 Signed-off-by: Jim Flynn <jim.flynn@arm.com> Signed-off-by: Kevin May <kevin.may@arm.com>
2019-12-18	MLCE-143 Fixed driver crash during CTS tests	Mike Kelly
	* Only apply the Optimization when the base ReshapeLayer is connected to the child ReshapeLayer and no other Layer. Signed-off-by: Mike Kelly <mike.kelly@arm.com> Change-Id: Iccd676d657f9e7c829813f1bec9c82db8745d069
2019-12-18	IVGCVSW-4293 Fix multiple Concat issues.	Derek Lamberti
	* Fix issue with InputLayer or ConstantLayer being used as inputs to Concat. * Fix issue with same input being used multiple times for same Concat. * Fix issue where input is used by multiple concats. Change-Id: Id4819aeec5a40e2afa0351838ba082b9f74aba33 Signed-off-by: Derek Lamberti <derek.lamberti@arm.com>
2019-12-17	IVGCVSW-4262 Use ACL Permute and Reshape Validate function in Neon and CL	Kevin May
	!android-nn-driver:2487 Signed-off-by: Kevin May <kevin.may@arm.com> Change-Id: Ibabb73c0ae0df2e530a68398f75c76e6b80c0701
2019-12-17	IVGCVSW-4267 Add missing layers to GetLayerTypeAsCString	Jim Flynn
	Change-Id: I5fc8804bb8d57077fd80dcc71e8e89d52d27b35b Signed-off-by: Jim Flynn <jim.flynn@arm.com>
2019-12-03	IVGCVSW-4206 Correctly pass execute network parameter.v19.11	Derek Lamberti
	Change-Id: I595b89dcb6419f7cb73cee51705b90f6eec7088b Signed-off-by: Derek Lamberti <derek.lamberti@arm.com>
2019-12-03	IVGCVSW-4206 Optionally parse unsupported ops in ExecuteNetwork	Derek Lamberti
	Change-Id: I593e2540bd870d70aabb2c959f4e63a899967269 Signed-off-by: Derek Lamberti <derek.lamberti@arm.com>
2019-11-27	IVGCVSW-4170 Also convert constants to FP16 when model converted	Derek Lamberti
	Signed-off-by: Derek Lamberti <derek.lamberti@arm.com> Signed-off-by: Kevin May <kevin.may@arm.com> Change-Id: I6318a70bd6818db55401bc667c116596e4909673
2019-11-23	IVGCVSW-4158 FP16 Mobilenet V1 and V2 30% regression on ArmNN on Mate20	Sadik Armagan
	* Enable FP16 mixed precision for Android Q Signed-off-by: Sadik Armagan <sadik.armagan@arm.com> Change-Id: I5ddb94b13385e1fec39e4407dffc8e4bc6b8d64a
2019-11-22	Github #251 Surround local structs with anonymous namespace	Matthew Bentham
	This fixes a one-definition-rule violation Change-Id: I0941ed21a04876009546b9b73f5fdfbf73c4110d Signed-off-by: Matthew Bentham <Matthew.Bentham@arm.com>
2019-11-21	IVGCVSW-4148 Report quant multiplier > 1 as unsupported for ACL	James Conroy
	* This is a temporary measure that needs to be removed when quantization multiplier > 1.0f support has been added for NEON and CL. * Layers affected: convolution, depthwise convolution, dilated depthwise convolution and transpose convolution. Signed-off-by: James Conroy <james.conroy@arm.com> Change-Id: Ief1aec2ff0eedf8250f6a8675288e1c343dcfce4
2019-11-21	IVGCVSW-4124 Replacing the "sleep_for" loop from FileOnlyProfilingConnection	Colm Donelan
	* Replacing the "sleep_for" loop in FileOnlyProfilingConnection with a producer consumer conditional mutex. * Reducing the times sleep loop times in FileOnlyProfilingDecoratorTests. Signed-off-by: Colm Donelan <Colm.Donelan@arm.com> Change-Id: Ied2302b508b6e4e6b50809c77e3f19115449d0b6
2019-11-20	IVGCVSW-4151 HAL 1_2 Dequantize FP32 Per Channel Tests on CpuAcc Failing	Sadik Armagan
	* Added support for data types QuantisedSymm8 and QuantizedSymm8PerAxis as they are supported on CpuAcc Signed-off-by: Sadik Armagan <sadik.armagan@arm.com> Change-Id: I55f81b35c8869bc37b7634bdbe91b8e3339eb648
2019-11-19	IVGCVSW-4070 Add CreatedNamedTypeEntity and CreateNamedTypedChildEntity	Narumol Prangnawarat
	functions with Guid Signed-off-by: Narumol Prangnawarat <narumol.prangnawarat@arm.com> Change-Id: Ide3c3b0a05830af055b3a2c733af4c1c57c0dbaa
2019-11-19	Revert "Only enable mixed precision FP16 pooling for Android Q"	Kevin May
	This reverts commit 60538ada2b90704abcf6473144639103d80287a5. Change-Id: I099e397fe1232e0f470d89a11d220752543e4e4c
2019-11-19	IVGCVSW-1530 Add TfLite slice parser and fix transpose perm vector creation	josh minor
	* TfLite slice parser and relevant tests added * TfLite transpose parser logic added to translate Tf/np permutation vector definitions to Armnn definitions * TfLite transpose parser no permute data test modified to include data for default permutation vector when none specified Signed-off-by: josh minor <josh.minor@arm.com> Change-Id: Iebd30971bd180593dc6b8f0d5be1d1bc61a3a5bf
2019-11-19	IVGCVSW-3697 Add check for ArgMinMax QAsymm8 to ClLayerSupport	Francis Murtagh
	* Enable Neon EndToEnd tests for ArgMinMax QAsymm8 * Enable Neon Layer tests for ArgMinMax QAsymm8 Signed-off-by: Francis Murtagh <francis.murtagh@arm.com> Change-Id: Ifa7463ded4397cacb82fb3667006f08ecbe3cd32
2019-11-19	IVGCVSW-4077 Fix issue when NEON import disabled	James Conroy
	* Removes workaround which handled null dstFactory when NEON import was disabled, and now handles this in the correct way. Signed-off-by: James Conroy <james.conroy@arm.com> Change-Id: Ief42b3c52d018f0fa71be4d4d37516f2caad1e0d
2019-11-19	IVGCVSW-4068 Add Guid to Workload	Narumol Prangnawarat
	* Add Guid to Workload * Remove circular dependency Signed-off-by: Narumol Prangnawarat <narumol.prangnawarat@arm.com> Signed-off-by: janeil01 <jan.eilers@arm.com> Change-Id: I15342fa7481c6bdc050e057dce2d74bba07fe2dd
2019-11-19	IVGCVSW-3729 Added neon slice workload and supporting neon layer tests	josh minor
	* Support added for ACL neon slice workload * Utility function created to translate ArmNN slice layer params to ACL neon slice layer equivalent * Neon slice layer tests added as per SliceTestImpl.hpp Signed-off-by: josh minor <josh.minor@arm.com> Change-Id: Id583465311879af139e8e977f16ed2280c937ac7
2019-11-19	MLCE-144 Cts NNAPI test cases failed	Mike Kelly
	* Fixed numerous CTS/VTS failures related to Quantization Signed-off-by: Mike Kelly <mike.kelly@arm.com> Change-Id: If5c20256366e80b6b9bbc46b2a1c410a9b8c48e1
2019-11-18	IVGCVSW-4116 Update ACL pin to the 19.11 release branch	Kevin May
	Signed-off-by: Kevin May <kevin.may@arm.com> Change-Id: I89a920b53b6f32ff37b7a4537e7ad38eed1ff608
2019-11-18	IVGCVSW-4062 Update Readme for 19.11	Nikhil Raj
	* Update BuildGuideCrossCompilation.md Signed-off-by: Nikhil Raj <nikhil.raj@arm.com> Change-Id: Ie16dfe477271e411eef0b5e68f636b81a61d5c33
2019-11-18	IVGCVSW-3980 Implementation of Guid generator	Narumol Prangnawarat
	* Improve implementation of Guid Generator to separate the range of Static Guid and Dynamic Guid * Unit tests to ensure non-collision Signed-off-by: Narumol Prangnawarat <narumol.prangnawarat@arm.com> Change-Id: I4ad1a75ea0b1f37155da0decafb51fc5a61e4187
2019-11-18	Fix quantizer crash by zero tensor	Tee Jung
	Signed-off-by: Jung Tae-young <tee.ty.jung@openedges.com> Signed-off-by: Matteo Martincigh <matteo.martincigh@arm.com> Change-Id: I1f0dfa4ca76e1c85a2b8fb5de12039a260224951
2019-11-18	IVGCVSW-4117 Update version number to 19.11	Aron Virginas-Tar
	Signed-off-by: Aron Virginas-Tar <Aron.Virginas-Tar@arm.com> Change-Id: If56648c1d51014b872e81ff4239060ccbf55e8dc
2019-11-15	Fix possible crash in case of zero dimension tensor in the ONNX	Tee Jung
	parser Signed-off-by: Jung Tae-young <tee.ty.jung@openedges.com> Signed-off-by: Matteo Martincigh <matteo.martincigh@arm.com> Change-Id: I396792d4d59172cccb50d77de7e6b74977b289ed
2019-11-15	IVGCVSW-3486 Add clipping parameter validation in LstmQueueDescriptor	janeil01
	* Add clipping parameter validation in LstmQueueDescriptor * Related UnitTest Signed-off-by: janeil01 <jan.eilers@arm.com> Change-Id: I86ff81cacc0e1fff5b78a8d6c2dcbf9ff57e2272
2019-11-15	IVGCVSW-4129 Fix thread starvation due to low capture periods	Colm Donelan
	* Set default capture period to 10mSec. * Validate capture period in PeriodicCounterSelectionCommandHandler pull it up to 10mSec if it is lower. * Fix segmentation fault in GatordMock when receive thread closes. Signed-off-by: Colm Donelan <Colm.Donelan@arm.com> Change-Id: I9f7ddc70bd99c102c5baef872d28329976a4dc07
2019-11-15	IVGCVSW-4074 Send Timeline message in RequestCounterDirectoryCommandHandler	Matteo Martincigh
	* Added call to SendTimelineMessageDirectoryPackage in the handler * Updated the unit tests accordingly * Refactored SendTimelinePacket to remove macro Signed-off-by: Matteo Martincigh <matteo.martincigh@arm.com> Change-Id: I7bb6f8575945b99a0e77ef30ecfe4dee3058669e
2019-11-15	IVGCVSW-4119 Fix FP16 to FP32 fallback mechanism in optimizer to work with ↵	Aron Virginas-Tar
	Dequantize * Check for output data type as well as input data type when determining whether we should attempt to fall back to FP32 if FP16 is not supported * Override output type for Dequantize in IsLayerSupported() instead of input type * Updated original input type from FP16 to FP32 in InsertConvertFp32ToFp16LayersAfter() Signed-off-by: Aron Virginas-Tar <Aron.Virginas-Tar@arm.com> Change-Id: Ic6477fd17cea5a91bd8bf9ae0cf836520897d5b7
2019-11-15	IVGCVSW-4140 Report per-axis quantization as unsupported for ↵	Aron Virginas-Tar
	DepthwiseConvolution on ACL backends * This is a temporary measure that needs to be removed as soon as the NEON and CL DepthwiseConvolution workloads will have added support for per-axis quantization Signed-off-by: Aron Virginas-Tar <Aron.Virginas-Tar@arm.com> Change-Id: I24eb285230293392a6ed50aece1101e5aed7f90e
2019-11-15	IVGCVSW-4073 Send stream info in the ConnectionAcknowledgedCommandHandler	Matteo Martincigh
	* Added call to ISendTimelinePacket::SendStreamMetaDataPacket * Added call to ISendTimelinePacket::SendTimelineMessageDirectoryPackage * Added new StreamMetadataCommandHandler class to the mock Gatord service * Updated code and unit tests * Added include paths to the gatord mock target Signed-off-by: Matteo Martincigh <matteo.martincigh@arm.com> Change-Id: Ic6d200b513175884607b7c0563cbfa4942ff2fc6
2019-11-15	IVGCVSW-4072 Add stream header to Timeline Message Directory packet	Matteo Martincigh
	* Refactored the WriteTimelineMessageDirectoryPacket function * Added the stream header to the packet * Updated decoders/parsers * Updated unit tests accordingly * Minor refactoring Signed-off-by: Matteo Martincigh <matteo.martincigh@arm.com> Change-Id: I58f15fde54adc6414ca9fd5fb8d6157cad867339
2019-11-15	NNXSW-1853 Change SubgraphViewSelector algorithm	Rob Hughes
	The current algorithm in SubgraphViewSelector has a bug that can lead to it producing subgraphs which have a dependency cycle (see the newly added test case 'ValidMerge' for a repro). It also fails to merge subgraphs in some cases where it could, which leads to smaller subgraphs. In the case of FSRCNN, the NPU cannot support these smaller subgraphs and so this is blocking us from supporting that network. This commit changes the algorithm to fix the dependency bug and also make it so that subgraphs are merged in the cases that were missed before. It also adds some unit tests to cover cases that were problematic before, and to extend coverage for the new algorithm. The new algorithm has two downsides compared to the previous one: 1. Disjoint subgraphs are not merged. This can never lead to a failed compilation by the NPU and so I believe this is less of an issue than the previous algorithm's "missed merges". This could however lead to a runtime performance loss in some cases as the NPU will be unable to parallelise as many operations. There are some unit tests that cover this which I have disabled. 2. The performance is worse. I have spent some time analysing this and for a graph with ~1000 layers the new algorithm takes 20ms vs. the old algorithm's 4ms (on my desktop PC). I believe the performance is still within acceptable limits. I also compared inception V3 (which was the network which caused performance issues with the original version of the splitting algorithm) and this new algorithm has not regressed there (200-300us in both cases). Change-Id: I1dd64a779f272723621e04d203b5a2752a6af2ef Signed-off-by: Robert Hughes <robert.hughes@arm.com>
2019-11-15	Print CMake messages on stdout rather than stderr	Rob Hughes
	The default version of message("...") print to stderr, which is inappropriate for informational messages such as the ones we are printing in these cases. Using message(STATUS "...") makes these messages appear on stdout instead which is more appropriate. Change-Id: I02f41e6b4948e6938566f06d7164444bd5b8199e Signed-off-by: Robert Hughes <robert.hughes@arm.com>
2019-11-15	Add FP16 support to DebugWorkload	Aron Virginas-Tar
	Signed-off-by: Aron Virginas-Tar <Aron.Virginas-Tar@arm.com> Change-Id: Ia879f2d84a1b977474ee0dafa976f2aab32bd3ae
2019-11-15	Only enable mixed precision FP16 pooling for Android Q	Derek Lamberti
	Change-Id: Ic2c0ce7a7a99bbc430b7d6da272825540772e01d Signed-off-by: Derek Lamberti <derek.lamberti@arm.com>
2019-11-14	Fix redundancy in call to configure() in ACL DepthwiseConvolution workloads	Aron Virginas-Tar
	Signed-off-by: Aron Virginas-Tar <Aron.Virginas-Tar@arm.com> Change-Id: I8f698c6ec9826ce1188bc43bd59fbf7b83455c1a
2019-11-14	Add SpaceToDepth to GetLayerTypeAsCString()	Aron Virginas-Tar
	Signed-off-by: Aron Virginas-Tar <Aron.Virginas-Tar@arm.com> Change-Id: I263c78e02238fa7c7f9ab6408fb197664e5fe048
2019-11-14	CL & Neon workload factories inherit from WorkloadFactoryBase	Derek Lamberti
	Change-Id: I1f694be7ef1d333b5ef9b60ea7029454ade02628 Signed-off-by: Derek Lamberti <derek.lamberti@arm.com>
2019-11-14	Fix link error due to pthread being linked in the wrong order	Rob Hughes
	Change-Id: I9602c758fe462b65d67de491d91fb2392b09b8bd Signed-off-by: Robert Hughes <robert.hughes@arm.com>
2019-11-14	Fix a few compile errors:	Rob Hughes
	* Replace use of non-standard integral types (e.g. u_char) * Convert boost::filesystem::paths to std::strings using the .string() method rather than .c_str(), because on Windows .c_str() returns a wide character string, which is not convertible to a std::string. Change-Id: Ia86b0653697033bb1afa01e64b5b2103dd042ffd Signed-off-by: Robert Hughes <robert.hughes@arm.com>
2019-11-13	IVGCVSW-3697 Add utility function to get ArgMinMaxFunction as string	Francis Murtagh
	* Allow logging of ConvertArgMinMax calls with specified Min/Max function which take place in HalPolicy Signed-off-by: Francis Murtagh <francis.murtagh@arm.com> Change-Id: Ic8c38106725023c864f7950dc9d0e2737485cfef
2019-11-13	IVGCVSW-4053 Enable ArgMinMax EndToEndTest for NEON/CL	James Conroy
	* Enabled for Float32 only, as per support in ACL. Signed-off-by: James Conroy <james.conroy@arm.com> Change-Id: I251fc832e3058d389ee9bef96856baff89ba6f9a
2019-11-13	IVGCVSW-4128 Add Signed32 to supported input types for Ref ArgMinMax	Francis Murtagh
	* Enabled RefLayerTests for Signed32 Signed-off-by: Francis Murtagh <francis.murtagh@arm.com> Change-Id: Idbe6fb7607c7e44a8df560b55f28c64a4c4286cd