From 57e79c0cf2b37d37c4895ee08fd7c2534ae44cad Mon Sep 17 00:00:00 2001 From: Eric Kunze Date: Tue, 3 Nov 2020 11:23:09 -0800 Subject: Add int48 to supported number formats Currently only enabled for the accumulator of a convolution and the input to a RESCALE. Signed-off-by: Eric Kunze Change-Id: Ic5474124b4a8735f12ef9c20c2be48e55767a3fc --- chapters/introduction.adoc | 7 ++++++- 1 file changed, 6 insertions(+), 1 deletion(-) diff --git a/chapters/introduction.adoc b/chapters/introduction.adoc index 5134330..53a6511 100644 --- a/chapters/introduction.adoc +++ b/chapters/introduction.adoc @@ -121,13 +121,18 @@ The following number formats are defined in TOSA. See section 2.3 for details on |int16 | -32768 | +32768 -|Signed 16-bit twos-complement values. Symmetric per-tensor quantization. No zero point is used , scale factor is provided in the operation. +|Signed 16-bit twos-complement values. Symmetric per-tensor quantization. No zero point is used, scale factor is provided in the operation. |int32 | (1<<31)-1 | -(1<<31) |32-bit twos-complement value. No scale factor used. +|int48 +| (1<<47)-1 +| -(1<<47) +|48-bit twos-complement value. No scale factor used. + |float | -infinity | +infinity -- cgit v1.2.1