diff options
author | Gian Marco Iodice <gianmarco.iodice@arm.com> | 2018-04-17 09:52:44 +0100 |
---|---|---|
committer | Anthony Barbier <anthony.barbier@arm.com> | 2018-11-02 16:49:37 +0000 |
commit | fd683111bba15288dc88b7f53486f935ebeccde0 (patch) | |
tree | b4e249618ff95c5b3f620fc4ee9500bead24e118 /src/core/CL/CLKernelLibrary.cpp | |
parent | dfca60b8e8805966624c7c941f289e090e3d73bb (diff) | |
download | ComputeLibrary-fd683111bba15288dc88b7f53486f935ebeccde0.tar.gz |
COMPMID-922 - CLGEMM FP16 optimizations - part1
This patch improves of ~20% GEMM fp16.
The results has been reported at the following confluence page:
https://confluence.arm.com/display/MLENG/GEMM+FP32+performance%3A+ACL+18.05
I am aware with few cases we have a bit of degradation. However this cases are
memory bound anyway (Fully connected layer cases)
Change-Id: I183cbb7fba55a0b5eb86532c4dca5efe096096b0
Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/128044
Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com>
Tested-by: Jenkins <bsgcomp@arm.com>
Reviewed-by: Anthony Barbier <anthony.barbier@arm.com>
Diffstat (limited to 'src/core/CL/CLKernelLibrary.cpp')
-rw-r--r-- | src/core/CL/CLKernelLibrary.cpp | 1 |
1 files changed, 1 insertions, 0 deletions
diff --git a/src/core/CL/CLKernelLibrary.cpp b/src/core/CL/CLKernelLibrary.cpp index 0509f1e785..ad028ec5e8 100644 --- a/src/core/CL/CLKernelLibrary.cpp +++ b/src/core/CL/CLKernelLibrary.cpp @@ -233,6 +233,7 @@ const std::map<std::string, std::string> CLKernelLibrary::_kernel_program_map = { "gemm_mm_interleaved_transposed_qs8", "gemm.cl" }, { "gemm_mm_interleaved_transposed_qs16", "gemm.cl" }, { "gemm_mm_floating_point", "gemm.cl" }, + { "gemm_mm_floating_point_f16_bifrost", "gemm.cl" }, { "gemm_mm_floating_point_f32_bifrost", "gemm.cl" }, { "gemm_mm_floating_point_f32_bifrost_1000", "gemm.cl" }, { "gemm_mm_qs8", "gemm.cl" }, |