aboutsummaryrefslogtreecommitdiff
path: root/src/core/CL/kernels/CLGEMMMatrixMultiplyKernel.cpp
AgeCommit message (Expand)Author
2020-07-09COMPMID-3324: Adjusting capitalization of Arm copyright claim to reflect Arm ...Michele Di Giorgio
2020-04-22COMPMID-3280: Make all ML primitives for CL use the new interface - Part1 - Fix2Manuel Bottini
2020-04-17COMPMID-3280: Make all ML primitives for CL use the new interface - Part 1Manuel Bottini
2020-03-10COMPMID-3069: Improve build time by splitting up ToolchainSupport.hMatthew Bentham
2019-08-02COMPMID-2539: Add bias addition check in CLGEMM validationGian Marco Iodice
2019-07-23COMPMID-1979: Fuse Activation Function in CLGEMM - part 3Gian Marco Iodice
2019-06-11COMPMID-2380: Create utility functions for is_one and is_zero with floatGian Marco Iodice
2019-02-07COMPMID-1706: Fuse the bias addition within CLGEMMMichele Di Giorgio
2019-01-25COMPMID-1866: Revisit padding and window on CLDepthwiseConvolutionNHWCMichele Di Giorgio
2019-01-14Issue COMPMID-1835: Remove CLGEMMInterleave4x4Kernel and replace with CLGEMMR...giuros01
2018-12-21COMPMID-1836: Remove CLGEMMTranspose1xWKernel and replace with CLGEMMReshapeR...giuros01
2018-11-20COMPMID-1801 : (Nightly) CLWinogradConvolutionLayer FP16 mismatchesVidhya Sudhan Loganathan
2018-11-16COMPMID-1266 : Add support for FP16 in CLWinogradConvolutionLayer: 5x5 kernelsVidhya Sudhan Loganathan
2018-11-08COMPMID-1736: Fixed out-of-bound write in CLIm2ColGian Marco Iodice
2018-11-02COMPMID-1451: Reverting changes for CLGEMM and CLGEMMLowp previuosly done (38...Isabella Gottardi
2018-11-02COMPMID-1607 - (Nightly) CLGEMMLowpMatrixMultiplyCore errors and mismatchesIsabella Gottardi
2018-11-02COMPMID-1607 - (Nightly) CLGEMMLowpMatrixMultiplyCore errors and mismatchesIsabella Gottardi
2018-11-02COMPMID-708 Fix AccessWindowTransposeGiorgio Arena
2018-11-02COMPMID-1478: Stop relying on static default OpenCL objects in cl2.hppAnthony Barbier
2018-11-02COMPMID-1188 - Fixed performance degradation with GEMM3DGian Marco Iodice
2018-11-02COMPMID-1276 - Allow GEMM to work with 3D input tensorGian Marco Iodice
2018-11-02COMPMID-1384: graph_mobilenet fails for NHWC on OpenCLGeorgios Pinitas
2018-11-02COMPMID-970 : Remove QS8 / QS16 supportVidhya Sudhan Loganathan
2018-11-02COMPMID-911: Allow GEMM to work with 3D tensorsIsabella Gottardi
2018-11-02COMPMID-817: Tuner: Port kernels to new design.Georgios Pinitas
2018-11-02COMPMID-655 : Check FP16 is supported by the GPUVidhya Sudhan Loganathan
2018-11-02COMPMID-1052 - Rework validate method in CLGEMMGian Marco Iodice
2018-11-02COMPMID-922 - CLGEMM FP16 optimizations - part2Gian Marco Iodice
2018-11-02COMPMID-1117: TransposeAccessWindow leads to high paddingGeorgios Pinitas
2018-11-02COMPMID-1026 - Add support for 4x4 output tile in CLWinogradConvolutionLayerGian Marco Iodice
2018-11-02COMPMID-922 - CLGEMM FP16 optimizations - part1Gian Marco Iodice
2018-11-02COMPMID-1031: Use LWS hints for G51, G51BIG, G51LIT, and TNOXSam Laynton
2018-11-02COMPMID-1032 - Fixing bug in CLGEMM when is_interleaved_transposed=trueGian Marco Iodice
2018-11-02COMPMID-935 - Implementing Convolution with Winograd on OpenCL (part 4)Gian Marco Iodice
2018-11-02COMPMID-886 Don't use LWS hints by default for GPU post Mali-G72Michalis Spyrou
2018-11-02COMPMID-935 - Implementing Convolution with Winograd on OpenCL (Part 1)Gian Marco
2018-11-02COMPMID-979: Add NHWC data layout to the tensor's metadata (Part 2)Isabella Gottardi
2018-11-02COMPMID-754: Add validation to kernels.Georgios Pinitas
2018-11-02COMPMID-748 - Integrating optimized SGEMM for bifrostGian Marco
2018-11-02COMPMID-759 - CLGEMM optimization for McVail benchmarksGian Marco
2018-11-02COMPMID-617: Adds CLFullyConnectionLayer validation supportGeorgios Pinitas
2018-11-02COMPMID-661: Vectorize im2col and add lws heuristics for convolution kernels #46Anthony Barbier
2018-11-02COMPMID-661: Optimize FC layer with 2 new Bifrost kernels and LWS tuning (#33)Anton Lokhmotov
2018-11-02COMPMID-524 - Implemented CLTuner objectGian Marco
2018-11-02COMPMID-477 - Optimized batched case in CLConvolutionLayerGian Marco Iodice
2018-11-02COMPMID-417: Cleanup CL FullyConnectedLayerMoritz Pflanzer
2018-09-17COMPMID-417: DepthConvert NEON for QS8/QS16.Georgios Pinitas
2018-09-17COMPMID-434 - Port CLGEMM to support 16 bit fixed pointGian Marco Iodice
2018-09-17COMPMID-411 - Port CLGEMM to support 8 bit fixed pointGian Marco Iodice
2018-09-17COMPMID-344 Updated doxygenAnthony Barbier