diff options
author | Anthony Barbier <anthony.barbier@arm.com> | 2018-07-23 16:42:59 +0100 |
---|---|---|
committer | Anthony Barbier <anthony.barbier@arm.com> | 2018-11-02 16:54:54 +0000 |
commit | 3d677ccee046cd384abf2142f323f8e9e7a4834f (patch) | |
tree | 2e0d86a1b2438cb94386c55d1bc89b3e1061214c /arm_compute/runtime/NEON/functions/NEGEMMAssemblyDispatch.h | |
parent | 597a85666a84c9a9414264966651551564b79299 (diff) | |
download | ComputeLibrary-3d677ccee046cd384abf2142f323f8e9e7a4834f.tar.gz |
COMPMID-1406: Refactor gemm_interleaved to use our own types and scheduler
- Ported PrepareB kernel from gemm_interleave
- Ported TransformA feature from gemm_interleave
- Allocate reshaped a and b buffers
- Added memory_manager / memory_group
- MatrixMultiply kernel
- Interleave kernels execution.
- Fixed a few bugs: all nightly Convolution tests passing for threads=1
and threads=4
- Added Doxygen documentations and comments in the code
- Added support for all data types supported
Change-Id: Iffa1c09fda0bb9c61213bb83524d5a48e7ecb03c
Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/141281
Tested-by: Jenkins <bsgcomp@arm.com>
Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com>
Diffstat (limited to 'arm_compute/runtime/NEON/functions/NEGEMMAssemblyDispatch.h')
-rw-r--r-- | arm_compute/runtime/NEON/functions/NEGEMMAssemblyDispatch.h | 5 |
1 files changed, 3 insertions, 2 deletions
diff --git a/arm_compute/runtime/NEON/functions/NEGEMMAssemblyDispatch.h b/arm_compute/runtime/NEON/functions/NEGEMMAssemblyDispatch.h index 382ef1caba..2fc2cf4a99 100644 --- a/arm_compute/runtime/NEON/functions/NEGEMMAssemblyDispatch.h +++ b/arm_compute/runtime/NEON/functions/NEGEMMAssemblyDispatch.h @@ -77,8 +77,9 @@ private: bool create_function(arm_gemm::GemmMethod method, const ITensor *a, const ITensor *b, ITensor *d, float alpha, float beta, bool pretranspose_hint); /** Interface for the arm_gemm fallback */ - std::unique_ptr<IFallback> _arm_gemm; - MemoryGroup _memory_group; /**< Function memory group */ + std::unique_ptr<IFallback> _arm_gemm; + MemoryGroup _memory_group; /**< Function memory group */ + std::shared_ptr<IMemoryManager> _memory_manager; /**< Copy of the memory manager used to create the memory group to be used when instantiating new functions */ public: /** If supported create an ACL function else fallback to the arm_gemm function. * |