diff options
author | Ramy Elgammal <ramy.elgammal@arm.com> | 2023-05-19 14:23:37 +0100 |
---|---|---|
committer | Ramy Elgammal <ramy.elgammal@arm.com> | 2023-06-23 20:06:45 +0000 |
commit | c952596e70f2fe0073029f053e329a4e930ced8c (patch) | |
tree | 1cf9b1c87c2288d6af436b570802d9cc6e8b30b5 /src/gpu/cl/ClKernelLibrary.cpp | |
parent | 47a50ef12f513cfa8fde6673b8a61ed0f2d0fbaa (diff) | |
download | ComputeLibrary-c952596e70f2fe0073029f053e329a4e930ced8c.tar.gz |
Implement FP32/FP16 MatMul NT/T kernel using the MMUL extension
Resolves COMPMID-6195
Signed-off-by: ramy.elgammal@arm.com <ramy.elgammal@arm.com>
Change-Id: I8e85fe73308ed84ebb142d6d6d1562b62dddfaa5
Reviewed-on: https://review.mlplatform.org/c/ml/ComputeLibrary/+/9819
Reviewed-by: SiCong Li <sicong.li@arm.com>
Benchmark: Arm Jenkins <bsgcomp@arm.com>
Tested-by: Arm Jenkins <bsgcomp@arm.com>
Comments-Addressed: Arm Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'src/gpu/cl/ClKernelLibrary.cpp')
-rw-r--r-- | src/gpu/cl/ClKernelLibrary.cpp | 1 |
1 files changed, 1 insertions, 0 deletions
diff --git a/src/gpu/cl/ClKernelLibrary.cpp b/src/gpu/cl/ClKernelLibrary.cpp index 408f1f7a21..5355cb7402 100644 --- a/src/gpu/cl/ClKernelLibrary.cpp +++ b/src/gpu/cl/ClKernelLibrary.cpp @@ -320,6 +320,7 @@ const std::map<std::string, std::string> ClKernelLibrary::_kernel_program_map = { "l2_normalize_y", "common/l2_normalize.cl" }, { "l2_normalize_z", "common/l2_normalize.cl" }, { "mat_mul_native_mmul_nt_nt", "common/mat_mul_mmul.cl" }, + { "mat_mul_native_mmul_nt_t", "common/mat_mul_mmul.cl" }, { "mat_mul_native_nt_nt", "common/mat_mul.cl" }, { "mat_mul_native_nt_t", "common/mat_mul.cl" }, { "mat_mul_native_t_nt", "common/mat_mul.cl" }, |