diff options
author | Eric Kunze <eric.kunze@arm.com> | 2024-02-23 13:53:41 -0800 |
---|---|---|
committer | Eric Kunze <eric.kunze@arm.com> | 2024-03-18 17:28:04 -0700 |
commit | 770b18e3d2f36f520b824c70a063381156511bb7 (patch) | |
tree | d223bb2960ab184814d7b93cbeb30d0496e7814b /chapters | |
parent | fb65bcf27eb77f593abd345aed0e884a440c827f (diff) | |
download | specification-770b18e3d2f36f520b824c70a063381156511bb7.tar.gz |
Add a new rationale appendix
This appendix attempts to record the rationale for decisions made
when modifying the TOSA specification.
Signed-off-by: Eric Kunze <eric.kunze@arm.com>
Change-Id: Ia2332d39c075a53f611665c446566bd89385b9fb
Diffstat (limited to 'chapters')
-rw-r--r-- | chapters/appendix_c.adoc | 29 |
1 files changed, 29 insertions, 0 deletions
diff --git a/chapters/appendix_c.adoc b/chapters/appendix_c.adoc new file mode 100644 index 0000000..9c67849 --- /dev/null +++ b/chapters/appendix_c.adoc @@ -0,0 +1,29 @@ +// +// This confidential and proprietary software may be used only as +// authorised by a licensing agreement from ARM Limited +// (C) COPYRIGHT 2024 ARM Limited +// ALL RIGHTS RESERVED +// The entire notice above must be reproduced on all authorised +// copies and copies may only be made to the extent permitted +// by a licensing agreement from ARM Limited. + +== Appendix C - Rationale + +This appendix documents the rationale behind decisions made while creating the TOSA specification. +Explanations and definitions contained in this appendix are non-normative. + +=== FP8 + +The operators that perform calculations on FP8 data types are limited. +Fewer mantissa bits in FP8 make it inappropriate for use in most elementwise operations such as <<ADD>>. +Support was also added to the data layout and movement operations on the understanding that no calculations are performed. +Two extensions for the FP8 types were created in order to cover both formats defined by <<OCP-OFP8,OCP-OFP8>>. + +=== Transcendental Functions + +In the TOSA specification, a limited number of transcendental operations are supported. +The operators supported are sufficient for common networks while minimizing the number of operations an implementation must support. +Originally, SIGMOID and TANH were added as the common functions used for activations. +ERF was added to support GELU style activation functions. +SIN and COS were added to provide a base level of trigonometric functionality as well as support for Rotary Position Embedding. + |