ArmNN
 20.02
NeonFullyConnectedWorkload Class Reference

#include <NeonFullyConnectedWorkload.hpp>

Inheritance diagram for NeonFullyConnectedWorkload:
BaseWorkload< FullyConnectedQueueDescriptor > IWorkload

Public Member Functions

 NeonFullyConnectedWorkload (const FullyConnectedQueueDescriptor &descriptor, const WorkloadInfo &info, std::shared_ptr< arm_compute::MemoryManagerOnDemand > &memoryManager)
 
virtual void Execute () const override
 
- Public Member Functions inherited from BaseWorkload< FullyConnectedQueueDescriptor >
 BaseWorkload (const FullyConnectedQueueDescriptor &descriptor, const WorkloadInfo &info)
 
void PostAllocationConfigure () override
 
const FullyConnectedQueueDescriptorGetData () const
 
profiling::ProfilingGuid GetGuid () const final
 
- Public Member Functions inherited from IWorkload
virtual ~IWorkload ()
 
virtual void RegisterDebugCallback (const DebugCallbackFunction &)
 

Additional Inherited Members

- Protected Attributes inherited from BaseWorkload< FullyConnectedQueueDescriptor >
const FullyConnectedQueueDescriptor m_Data
 
const profiling::ProfilingGuid m_Guid
 

Detailed Description

Definition at line 26 of file NeonFullyConnectedWorkload.hpp.

Constructor & Destructor Documentation

◆ NeonFullyConnectedWorkload()

NeonFullyConnectedWorkload ( const FullyConnectedQueueDescriptor descriptor,
const WorkloadInfo info,
std::shared_ptr< arm_compute::MemoryManagerOnDemand > &  memoryManager 
)

Definition at line 48 of file NeonFullyConnectedWorkload.cpp.

References BaseWorkload< FullyConnectedQueueDescriptor >::m_Data, QueueDescriptor::m_Inputs, QueueDescriptor::m_Outputs, and QueueDescriptor::ValidateInputsOutputs().

50  : BaseWorkload<FullyConnectedQueueDescriptor>(descriptor, info)
51 {
52  m_Data.ValidateInputsOutputs("NeonFullyConnectedWorkload", 1, 1);
53 
54  arm_compute::ITensor& input = boost::polymorphic_downcast<IAclTensorHandle*>(m_Data.m_Inputs[0])->GetTensor();
55  arm_compute::ITensor& output = boost::polymorphic_downcast<IAclTensorHandle*>(m_Data.m_Outputs[0])->GetTensor();
56 
57  m_WeightsTensor = std::make_unique<arm_compute::Tensor>();
58  BuildArmComputeTensor(*m_WeightsTensor, m_Data.m_Weight->GetTensorInfo());
59 
61  {
62  m_BiasesTensor = std::make_unique<arm_compute::Tensor>();
63  BuildArmComputeTensor(*m_BiasesTensor, m_Data.m_Bias->GetTensorInfo());
64  }
65 
66  // Construct
67  arm_compute::FullyConnectedLayerInfo fc_info;
68  fc_info.transpose_weights = m_Data.m_Parameters.m_TransposeWeightMatrix;
69 
70  auto layer = std::make_unique<arm_compute::NEFullyConnectedLayer>(memoryManager);
71  layer->configure(&input, m_WeightsTensor.get(), m_BiasesTensor.get(), &output, fc_info);
72  m_FullyConnectedLayer.reset(layer.release());
73 
74  // Allocate
76  {
78  }
79  else
80  {
82  }
83 
84  if (m_BiasesTensor)
85  {
87  {
89  }
90  else
91  {
93  }
94  }
95 
96  // Force Compute Library to perform the necessary copying and reshaping, after which
97  // delete all the input tensors that will no longer be needed
98  m_FullyConnectedLayer->prepare();
99  FreeUnusedTensors();
100 }
const ConstCpuTensorHandle * m_Weight
bool m_TransposeWeightMatrix
Enable/disable transpose weight matrix.
const FullyConnectedQueueDescriptor m_Data
Definition: Workload.hpp:46
void ValidateInputsOutputs(const std::string &descName, unsigned int numExpectedIn, unsigned int numExpectedOut) const
DataType GetDataType() const
Definition: Tensor.hpp:95
bool m_BiasEnabled
Enable/disable bias.
void InitializeArmComputeTensorData(arm_compute::Tensor &tensor, const ConstCpuTensorHandle *handle)
std::vector< ITensorHandle * > m_Outputs
std::vector< ITensorHandle * > m_Inputs
const ConstCpuTensorHandle * m_Bias
const TensorInfo & GetTensorInfo() const

Member Function Documentation

◆ Execute()

void Execute ( ) const
overridevirtual

Implements IWorkload.

Definition at line 102 of file NeonFullyConnectedWorkload.cpp.

References ARMNN_SCOPED_PROFILING_EVENT_NEON.

103 {
104  ARMNN_SCOPED_PROFILING_EVENT_NEON("NeonFullyConnectedWorkload_Execute");
105  m_FullyConnectedLayer->run();
106 }
#define ARMNN_SCOPED_PROFILING_EVENT_NEON(name)

The documentation for this class was generated from the following files: