aboutsummaryrefslogtreecommitdiff
path: root/chapters/scatter_gather.adoc
blob: e1be77f9b28b38223f3ae7bf9ca3b3689a1f2a2c (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
//
// This confidential and proprietary software may be used only as
// authorised by a licensing agreement from ARM Limited
// (C) COPYRIGHT 2020 ARM Limited
// ALL RIGHTS RESERVED
// The entire notice above must be reproduced on all authorised
// copies and copies may only be made to the extent permitted
// by a licensing agreement from ARM Limited.

=== Scatter/Gather Operators

==== GATHER

Generate a tensor for which each element in the output is a subtensor of the values tensor based on the indices.
N is the number of batches, W the number of indices in each batch, K the range of each index and C the number data channels for each index.

*Arguments:*

|===
|Argument|Type|Name|Shape|Description

|Input|value_t*|values|[N,K,C]|3D value tensor
|Input|index_t*|indices|[N,W]|2D index tensor
|Output|value_t*|output|[N,W,C]|3D output tensor
|===

*Quantization Parameters:*

None

*Operation Function:*

[source,c]
----
for_each(0<=n<N, 0<=w<W, 0<=c<C) {
    index_t k = tensor_read<index_t>(indices, [N,W], [n,w])
    assert(0<=k && k<K)
    value_t value = tensor_read<value_t>(values, [N,K,C], [n, k, c])
    tensor_write<value_t>(output, [N,W,C], [n,w,c], value)
}
----

*Supported Data Types:*

|===
|Profile|Mode|index_t|value_t

|Any|signed 8|int32|aint8
|Any|signed 16|int32|int16
|Any|signed 32|int32|int32
|MI,MT|float|int32|float
|===

==== SCATTER

The values_out tensor is set to the values_in tensor with data modified as follows: data from the input tensor is inserted at the positions specified by the indices tensor.
N is the number of batches, W the number of indices in each batch, K the range of each index and C the number data channels for each index.

*Arguments:*

|===
|Argument|Type|Name|Shape|Description

|Input|value_t*|values_in|[N,K,C]|3D values in tensor
|Input|index_t*|indices|[N,W]|2D index tensor
|Input|value_t*|input|[N,W,C]|3D input tensor
|Output|value_t*|values_out|[N,K,C]|3D values out tensor
|===

*Quantization Parameters:*

None

*Operation Function:*

[source,c]
----
// Copy the values_in tensor to the values_out tensor.
// Values not written by the scatter operation are unchanged in the output.
for_each(0<=n<N, 0<=k<K, 0<=c<C) {
    value_t value = tensor_read<value_t>(values_in, [N,K,C], [n,k,c])
    tensor_write<value_t>(values_out, [N,K,C], [n, k, c], value)
}
// Now perform the SCATTER operation, writing to the positions from the indices tensor
for_each(0<=n<N, 0<=w<W, 0<=c<C) {
    index_t k = tensor_read<index_t>(indices, [N,W], [n,w])
    assert(0<=k && k<K)
    value_t value = tensor_read<value_t>(input, [N,W,C], [n,w,c])
    tensor_write<value_t>(values_out, [N,K,C], [n, k, c], value)
}
----

*Supported Data Types:*

|===
|Profile|Mode|index_t|value_t

|Any|signed 8|int32|aint8
|Any|signed 16|int32|int16
|Any|signed 32|int32|int32
|MI,MT|float|int32|float
|===