Age | Commit message (Collapse) | Author |
|
* Refactored all the workload factories to use "virtual" properly
* Removed the precompiled layer support from the optimizer
* Disabled compiler optimization for debug build to ease debugging
Change-Id: I6ab83abd78f422771359295e25b79bb9be613cfd
|
|
Change-Id: I0ab3339e8803a3e4e700d8fec9883eccc524b31e
|
|
* add implemenentation for GatherQueueDescriptor validate function
* add FirstInputTypedWorkload to allow type check on the first input tensor only
* add ref workload implemenentation for float and uint8
* add Gather layer support in Ref
* unit tests
Change-Id: I4578a3211f11d24aa29d15bcf7f45b0445bcd1ee
|
|
Change-Id: I7359617a307b9abb4c30b3d5f2364dc6d0f828f0
|
|
* Added GatherQueueDescriptor to WorkloadData
* Added CreateGather function in WorkloadFactory.hpp
* Added stub implementation of the CreateGreater function in workload factories
* Added GatherLayer stub implementation
* Added AddGatherLayer to Network
* Added IsGatherSupported to LayerSupportBase
Change-Id: I0408fd54e88a7d4e3d9e1c2811a9323f0da52a04
|
|
separate branch) and master
* Brings in all the changes done for the pluggable backends
* Added sub-graph support and tests
* Added precompiled layer support and tests
* Moved BackendSettings to a separate file
* Removed the backend-specific code
* Ported DebugLayer and associated functionality
* Included fixes to make those changes work with master
Change-Id: Id7028fa7917527b844628d5aff5732e3d94c0488
|
|
whats done in the constructor.
Change-Id: I9488cebd63767c0c51d9ee6333d5aaf0e4a85847
|
|
code base
* Made the virtual functions in ILayerSupport.hpp pure
* Created a LayerSupportBase class with the default implementation of
the interface
* Made the backend layer support classes inherit from the base
class, instead of directly from the interface
* Refactored the profiler and the profiling event classes to use
the BackendId instead of the Compute
* Implemented a proper MemCopy support method
* Changed Compute to BackendId in the profiling API and objects
* Removed static references to pluggable backends
!android-nn-driver:492
Change-Id: Id6332b5f48c980819e0a09adc818d1effd057296
|
|
Change-Id: I43be451f490db0154021f47a2fd49d1269cf5b95
|
|
* Added CL unit tests
Change-Id: Ia4af49b133774141384a177f5b042e65764bf682
|
|
Change-Id: I0bc48f118aba419abb659657b83224401fac0715
|
|
Just include the function definition that is specifically needed for each workload.
Also, tighten up the scope where Compute Library functions are available.
Knocks about 30seconds off a 4m30s single-threaded compile of the Neon workloads.
Change-Id: Idac438f3bc77ff978295fbc9505cb42447def145
|
|
*Added Unit Tests
Change-Id: I6cceb8e6dcda35ce08415f8e5ca86019a64d26e3
|
|
in favor of a type-indepent implementation that takes void-pointers
* The new implementation requires the size of the type to be passed
to the function
* Updated all the usages accordingly
* Removed the old implementation no longer used
!android-nn-driver:469
Change-Id: I37f4e6d62a38fbb8ec8c39bb559a2c54c83365d4
|
|
* Unified ArmNN's weight format to [ M, I, H, W ] for the depthwise convolution
* Added conversion utilities to permute/reshape the weights as appropriate
when using CL and Neon backends
* Updated the reference implementation of the convolution
* Updated the relevant unit tests accordingly
!android-nn-driver:459
Change-Id: I07d0818efa9d1ca1e5dad82983aac1fe78eadb18
|
|
default cases are handled by DefaultLayerSupport
Change-Id: I31942a5e8515d6fdce9a0adf20906eaabc2b9aa6
|
|
StringMapping was located in the backendsCommon
Change-Id: I1c4a0bfca986722c721fcb2b460c03d6e1cd4568
|
|
Change-Id: Icf7ef88a2eaef80ec32cc718b0ca9d26e830ed07
|
|
Change-Id: Iae6a3bfad06cb669ef0abea465d03163bb8316f5
|
|
Factor out new BuildArmComputeReductionCoordinates function
from CL backend into ArmComputeTensorUtils.
Update NEON LayerSupport and WorkloadFactory objects
Change-Id: Icc975ec699199bffafbdb207323df509d35e1e04
|
|
Technical debt work towards adding some new Neon workloads
Change-Id: I08ab6dd14d0e89d4ebc8a878fb69caa5681012bf
|
|
Change-Id: Ibe1b27b268011878c7dce3c96efea01402453027
|
|
!android-nn-driver:428
Change-Id: I42e59ad96d2c80f46b085182855d34b710a74dfe
|
|
* Add Arithmetic end-to-end test implementation
* Enable tests for float, Uint8 and Broadcast
Change-Id: I81c7096e9b6ad29eaa935b74ad5f30f823be2331
|
|
Change-Id: Iddae3486641d2d195827ae4e0a9dfa3f7ccd65e3
|
|
* Added CLMaximumWorkload implementation
* Added CL unit tests
Change-Id: I922c83fe25d17be21a5d0f4e861038463cb09789
|
|
* Added the Greater operation as an element-wise workload
* Added the unit tests
Change-Id: Ie00ee30e47a5f5e17a728032eeb11a085d06c8f2
|
|
* Add reference equal workload
* Add Reference Workload Unit Test
Change-Id: If2848e7dde4248566b99d91726d08143c40ff80d
|
|
* Added ACl Implementation to ArmNN
* Added helper function for setting CL Strided Slice arguments
Change-Id: Ie10d387fdb054027ea9b8782743d270bb72949c1
|
|
Equal operation
* Add QueueDescriptor in WorkloadData.hpp
* Add CreateEqual function in WorkloadFactory.hpp
* Added stub implementation of the CreateEqual function in RefWorkloadFactory,
NeonWorkloadFactory and ClWorkloadFactory
Change-Id: Iec6dc2f989c67fa6f0c32cfb93508995c8580783
|
|
Change-Id: I56f56398e7c2c2ac57a96952237e7f1bc0ced208
|
|
per COMPMID-1818 change to using Blob-based memory manager.
Will change back when COMPMID-1848 is fixed.
Change-Id: I6e22c2edf422202410faa717b330a2c96c3ebbe5
|
|
Greater operation
* Added QueueDescriptor in WorkloadData.hpp
* Added CreateGreater function in WorkloadFactory.hpp
* Added stub implementation of the CreateGreater function in RefWorkloadFactory,
NeonWorkloadFactory and ClWorkloadFactory
* Added GreaterLayer stub implementation
* Renamed ArithmeticBaseLayer to ElementwiseBaseLayer
Change-Id: I7e38c2936de905da921a92ba3f918478169ec7f5
|
|
* Modified optimizer to support debug mode via DebugLayer
Change-Id: Ic8f313778e55540c182cf99876c44a0823be04c6
|
|
Kernel position within the layer is part of the kernel name
Change-Id: I3b9ee870fbd0db025f1515c81245f917189ae884
|
|
Change-Id: I0d69a72df9e348cbeca690926485c050499d1051
|
|
Unit tests not yet added as need Compute Library a84faffd.
Change-Id: Ica16df493e8d6a76da9d1f74bf43b8403f9dff62
|
|
NEFillBorderKernel may not be issued for some activation workloads
depending on the version of the Compute Library.
Change-Id: I01b8960f9eb7a0192cd351eddbb416ac05248b34
|
|
Change-Id: I87c53c78dd033871dbc543550a61d2d535e752ac
|
|
Change-Id: Ib2e5de2a057da57ef77a9f5c4367d699d4773294
|
|
Change-Id: I6b26c681989ef815752186f027a245ef8da7aea2
|
|
* Removed the input swizzling when the data layout is NHWC
* Permuting weights depending on the data layout used
* Added getter methods to ParsedConstTfOperation to get the tensor
info and the storage memory area, needed for swizzling the weights
* Added unit tests for both NHWC and NCHW data layouts
Change-Id: I6543900c594417df630b2663d8551158b93b7836
|
|
* Also added unit test armnnTfParser/test/Pad.cpp
* Added missing 'Pad' entry to GetLayerTypeAsCString(LayerType)
* Fixed the RefLayerSupport.IsPadSupported, now it returns true
* Small fix in Optimize. Only resolve stringstream to string once
Change-Id: Ieaa1886858a48cd761ac5f30454f73e44bdd4b8f
|
|
* Since DataLayoutIndexed is now required in the TF parser, this
changes move it to the armnnUtils library so that it'll be
accessible by the armnnTfParser
* Modified CMake files and Android.mk files accordingly
Change-Id: Ie2620359ef288aeff64cb9e9bec068a466eee0e9
|
|
Change-Id: I017ca6c23b62a8978982de0ca4ad204cb8cf7c67
|
|
Change-Id: I5455b720565248ff94278e76887d63f8434a7b58
|
|
* Enable Maximum workload
Change-Id: I5f7786ff66f135f0c74935dbf5861e5a4239bcc8
|
|
* Changed NeonL2Normalisation to use NEL2NormalizeLayer to
normalise along the channel axis in either NCHW or NHWC format
Change-Id: Ibaf119b6a3de3c0f80f94b1c5fe9a356cf1fbd0e
|
|
Change-Id: Id7302c6b1df995ebe6eb8eb94bab38bee1b31b0b
|
|
Change-Id: I03fa374fd9692d98257de709f8c3ad0a49c88b95
|