aboutsummaryrefslogtreecommitdiff
path: root/src/runtime/NEON/functions/NEGEMMConvolutionLayer.cpp
diff options
context:
space:
mode:
authorGian Marco Iodice <gianmarco.iodice@arm.com>2018-04-19 12:05:08 +0100
committerAnthony Barbier <anthony.barbier@arm.com>2018-11-02 16:51:17 +0000
commitbb36a8efc1092f66798e3b880c55ec488021bb02 (patch)
tree62e0265d84575bc10496c84f4908ed27529166ea /src/runtime/NEON/functions/NEGEMMConvolutionLayer.cpp
parent4dcb583c052e14f08809cc9ee420e690264e7bbe (diff)
downloadComputeLibrary-bb36a8efc1092f66798e3b880c55ec488021bb02.tar.gz
COMPMID-922 - CLGEMM FP16 optimizations - part2
This patch improves of ~30 % GEMM fp16 when the reshape is required The results have been reported at the following confluence page: https://confluence.arm.com/display/MLENG/GEMM+FP16+performance%3A+ACL+18.05 Change-Id: I8233095a7e9ab06f1f915782a25dd41653b49140 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/128254 Reviewed-by: Anthony Barbier <anthony.barbier@arm.com> Tested-by: Jenkins <bsgcomp@arm.com>
Diffstat (limited to 'src/runtime/NEON/functions/NEGEMMConvolutionLayer.cpp')
0 files changed, 0 insertions, 0 deletions