aboutsummaryrefslogtreecommitdiff
path: root/src/gpu
diff options
context:
space:
mode:
authorJakub Sujak <jakub.sujak@arm.com>2023-04-18 08:33:56 +0100
committerJakub Sujak <jakub.sujak@arm.com>2023-04-27 10:23:50 +0000
commit5e99a3e4d65f814c5e6938c31a0ef505d0fb8f17 (patch)
treeee01c5cbd7721c4158d564ca59ad6439d6d45bee /src/gpu
parentf16eed979ecaa234b308c8eb145c5f9512673a54 (diff)
downloadComputeLibrary-5e99a3e4d65f814c5e6938c31a0ef505d0fb8f17.tar.gz
Add quantized CL MatMul kernel for LHS NT, RHS T
Implement a native kernel for batched Matrix Multiplication for the quantized data types QASYMM8 and QASYMM8_SIGNED and with the MatMul attributes `adj_x = false, adj_y = true`. Resolves: COMPMID-5923 Change-Id: I477b2dd886edfe83beaba9efc7d6b05ed19f5da4 Signed-off-by: Jakub Sujak <jakub.sujak@arm.com> Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/9467 Tested-by: Arm Jenkins <bsgcomp@arm.com> Reviewed-by: SiCong Li <sicong.li@arm.com> Comments-Addressed: Arm Jenkins <bsgcomp@arm.com> Benchmark: Arm Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'src/gpu')
-rw-r--r--src/gpu/cl/ClKernelLibrary.cpp1
1 files changed, 1 insertions, 0 deletions
diff --git a/src/gpu/cl/ClKernelLibrary.cpp b/src/gpu/cl/ClKernelLibrary.cpp
index 4612ca35b8..a9080049b5 100644
--- a/src/gpu/cl/ClKernelLibrary.cpp
+++ b/src/gpu/cl/ClKernelLibrary.cpp
@@ -324,6 +324,7 @@ const std::map<std::string, std::string> ClKernelLibrary::_kernel_program_map =
{ "mat_mul_native_t_nt", "common/mat_mul.cl" },
{ "mat_mul_native_t_t", "common/mat_mul.cl" },
{ "mat_mul_native_quantized_nt_nt", "common/mat_mul_quantized.cl" },
+ { "mat_mul_native_quantized_nt_t", "common/mat_mul_quantized.cl" },
{ "mat_mul_native_quantized_t_nt", "common/mat_mul_quantized.cl" },
{ "mat_mul_native_quantized_t_t", "common/mat_mul_quantized.cl" },
{ "max_unpooling_layer_2", "common/unpooling_layer.cl" },