summaryrefslogtreecommitdiff
path: root/docs/sections/testing_benchmarking.md
diff options
context:
space:
mode:
Diffstat (limited to 'docs/sections/testing_benchmarking.md')
-rw-r--r--docs/sections/testing_benchmarking.md45
1 files changed, 33 insertions, 12 deletions
diff --git a/docs/sections/testing_benchmarking.md b/docs/sections/testing_benchmarking.md
index 6350f52..d040d17 100644
--- a/docs/sections/testing_benchmarking.md
+++ b/docs/sections/testing_benchmarking.md
@@ -47,8 +47,6 @@ arm_ml_embedded_evaluation_kit-<use_case>-tests
```log
INFO - native platform initialised
-INFO - ARM Ethos-U55 Evaluation application for MPS3 FPGA Prototyping Board and FastModel
-
...
===============================================================================
All tests passed (37 assertions in 7 test cases)
@@ -60,11 +58,24 @@ INFO - ARM Ethos-U55 Evaluation application for MPS3 FPGA Prototyping Board and
Profiling is enabled by default when configuring the project. Profiling enables you to display:
-- The active and idle NPU cycle counts when the Arm® *Ethos™-U55* is enabled. For more information, refer to the
- `-DETHOS_U55_ENABLED` section in: [Build options](./building.md#build-options).
+- NPU event counters when the Arm® *Ethos™-U* NPU is enabled (see `ETHOS_U_NPU_ENABLED` in [Build options](./building.md#build-options) )
+
+ - Total cycle: The number of NPU cycles
+
+ - Active cycles: number of NPU cycles that were used for computation
+
+ - Idle cycles: number of cycles for which the NPU was idle
+
+ - AXI0 read beats: The number of AXI beats with read transactions from AXI0 bus. AXI0 is the bus where
+ *Ethos-U* NPU reads and writes to the computation buffers, activation buf, or tensor arenas.
+
+ - AXI0 write beats: The number of AXI beats with write transactions to AXI0 bus.
+
+ - AXI1 read beats: The number of AXI beats with read transactions from AXI1 bus. AXI1 is the bus where
+ *Ethos-U* NPU reads the model. So, read-only.
+
- If CPU profiling is enabled, the CPU cycle counts and the time elapsed, in milliseconds, for inferences performed. For
- further information, please refer to the `-DCPU_PROFILE_ENABLED` section in:
- [Build options](./building.md#build-options).
+ further information, please refer to the `CPU_PROFILE_ENABLED` in [Build options](./building.md#build-options).
> **Note:** Only do this when running on a physical FPGA board as the FVP does not contain a cycle-approximate or
> cycle-accurate *Cortex-M* model.
@@ -74,18 +85,28 @@ For example:
- On the FVP:
```log
- Active NPU cycles: 5475412
- Idle NPU cycles: 702
+INFO - Profile for Inference:
+INFO - NPU AXI0_RD_DATA_BEAT_RECEIVED beats: 628122
+INFO - NPU AXI0_WR_DATA_BEAT_WRITTEN beats: 135087
+INFO - NPU AXI1_RD_DATA_BEAT_RECEIVED beats: 62870
+INFO - NPU ACTIVE cycles: 1081007
+INFO - NPU IDLE cycles: 626
+INFO - NPU TOTAL cycles: 1081634
```
- For the MPS3 platform, the time duration in milliseconds is also reported when `-DCPU_PROFILE_ENABLED=1` is added to
CMake configuration command, like so:
```log
- Active NPU cycles: 5629033
- Idle NPU cycles: 1005276
- Active CPU cycles: 993553 (approx)
- Time in ms: 210
+INFO - Profile for Inference:
+INFO - NPU AXI0_RD_DATA_BEAT_RECEIVED beats: 628122
+INFO - NPU AXI0_WR_DATA_BEAT_WRITTEN beats: 135087
+INFO - NPU AXI1_RD_DATA_BEAT_RECEIVED beats: 62870
+INFO - NPU ACTIVE cycles: 1081007
+INFO - NPU IDLE cycles: 626
+INFO - NPU TOTAL cycles: 1081634
+INFO - CPU ACTIVE cycles (approx): 993553
+INFO - Time ms: 210
```
The next section of the documentation refers to: [Memory Considerations](memory_considerations.md).