aboutsummaryrefslogtreecommitdiff
path: root/src/core/CL/cl_kernels/gemm.cl
AgeCommit message (Expand)Author
2019-09-02COMPMID-1965 Extend CLGEMMMatrixMultiplyReshapedKernel to support transposed ...Giorgio Arena
2019-07-23COMPMID-1979: Fuse Activation Function in CLGEMM - part 3Gian Marco Iodice
2019-07-17COMPMID-1979: Fuse Activation Function in CLGEMM - part 2Gian Marco Iodice
2019-06-24COMPMID-2172: Fuse bias addition with CLGEMMMatrixMultiplyNativeKernelGian Marco Iodice
2019-06-20COMPMID-2053: Fuse bias addition with CLGEMMMatrixMultiplyReshapedKernelGian Marco Iodice
2019-06-04COMPMID-2171: Fuse bias addition with CLGEMMMatrixMultiplyReshapedOnlyRHSKernelGeorgios Pinitas
2019-05-20COMPMID-2338: Remove CLGEMMInterleave4x4 and CLGEMMTranspose1xWGian Marco Iodice
2019-05-17COMPMID-2093: Implement CLGEMMNativegiuros01
2019-05-16COMPMID-2041: Create GEMM helper file for OpenCL.Usama Arif
2019-04-01COMPMID-2002: Implement CLGEMMLowpMatrixMultiplyReshapedOnlyRHS - TransposedGian Marco Iodice
2019-03-20COMPMID-2043: Add support for "dummy threads" in CLGEMMReshapedGian Marco Iodice
2019-03-12COMPMID-1964: Implement CLGEMMMatrixMultiplyReshapedOnlyRHS - Not transposedGian Marco Iodice
2019-03-08COMPMID-2000: Implement CLGEMMMatrixMultiplyReshapedOnlyRHS - TransposedGian Marco Iodice
2019-02-07COMPMID-1706: Fuse the bias addition within CLGEMMMichele Di Giorgio
2019-01-24COMPMID-1900: Nightly issue with GEMMReshapeLHSMatrixGian Marco Iodice
2019-01-21COMPMID-1899: Fix NaN issue in CLGEMMMatrixMultiplyReshapedKernelGian Marco Iodice
2019-01-15COMPMID-1687: Optimize CLGEMMMatrixMultiplyKernel (part 1)Gian Marco Iodice
2019-01-09COMPMID-1837 : Implement REPEAT utility macro on OpenCLVidhya Sudhan Loganathan
2018-12-20COMPMID-1858: Fix boundary check in gemm_reshape_rhs_matrix_t and gemm_reshap...Gian Marco Iodice
2018-12-19COMPMID-1834: Add transpose support to CLGEMMReshapeLHSMatrixKernelGian Marco Iodice
2018-12-17COMPMID-1710: Fixing gemm_mm_reshaped_lhs_nt_rhs_t with REINTERPRET_OUTPUT_AS_3DGian Marco Iodice
2018-12-14COMPMID-1687: Optimize CLGEMMMatrixMultiplyKernel for Mali-G76 - Part1Gian Marco Iodice
2018-12-11COMPMID-1775: Implement CLGEMMReshapeRHSMatrixKernel to reshape the RHS matri...Gian Marco Iodice
2018-12-10COMPMID-1774: Implement CLGEMMReshapeLHSMatrixKernel to reshape the LHS matri...Gian Marco Iodice
2018-11-20COMPMID-1801 : (Nightly) CLWinogradConvolutionLayer FP16 mismatchesVidhya Sudhan Loganathan
2018-11-16COMPMID-1266 : Add support for FP16 in CLWinogradConvolutionLayer: 5x5 kernelsVidhya Sudhan Loganathan
2018-11-02COMPMID-1413 - Improve the performance of GEMMLowp with 8 bit dot product on ...Gian Marco Iodice
2018-11-02COMPMID-1276 - Allow GEMM to work with 3D input tensorGian Marco Iodice
2018-11-02COMPMID-1384: graph_mobilenet fails for NHWC on OpenCLGeorgios Pinitas
2018-11-02COMPMID-970 : Remove QS8 / QS16 supportVidhya Sudhan Loganathan
2018-11-02COMPMID-1307: Mismatches in CLGEMMConvolutionLayer F16Georgios Pinitas
2018-11-02COMPMID-911: Allow GEMM to work with 3D tensorsIsabella Gottardi
2018-11-02COMPMID-1083 : Compute library should be made usable on non-ARM platformsVidhya Sudhan Loganathan
2018-11-02COMPMID-1083 : Compute library should be made usable on non-ARM platformsVidhya Sudhan Loganathan
2018-11-02COMPMID-1044: Optimizing GCGEMM - Support for not reshaped GEMM on GLESMichele Di Giorgio
2018-11-02COMPMID-1150 : (OCLGrind) Kernel compilation error and assertionGeorgios Pinitas
2018-11-02COMPMID-922 - CLGEMM FP16 optimizations - part2Gian Marco Iodice
2018-11-02COMPMID-1056 - Optimizing CLGEMMMatrixMultiplyKernel refactoring the inner loopGian Marco Iodice
2018-11-02COMPMID-922 - CLGEMM FP16 optimizations - part1Gian Marco Iodice
2018-11-02COMPMID-1032 - Fixing bug in CLGEMM when is_interleaved_transposed=trueGian Marco Iodice
2018-11-02COMPMID-935 - Implementing Convolution with Winograd on OpenCL (part 4)Gian Marco Iodice
2018-11-02COMPMID-935 - Implementing Convolution with Winograd on OpenCL (Part 1)Gian Marco
2018-11-02COMPMID-882 - Optimizing GEMMLowp on OpenCL reshaping matricesGian Marco
2018-11-02COMPMID-748 - Integrating optimized SGEMM for bifrostGian Marco
2018-11-02COMPMID-697 - Rework GEMMLowp interface on OpenCLGian Marco
2018-11-02COMPMID-661: Optimize FC layer with 2 new Bifrost kernels and LWS tuning (#33)Anton Lokhmotov
2018-11-02Allow running without cl_khr_fp16Matthew Bentham
2018-11-02COMPMID-640: FullyConnectedLayer failures on both NEON/CLGeorgios Pinitas
2018-11-02COMPMID-477 - Optimized batched case in CLConvolutionLayerGian Marco Iodice
2018-09-17COMPMID-415: Use half_float library for F16Moritz Pflanzer