aboutsummaryrefslogtreecommitdiff
path: root/src/core/CL
AgeCommit message (Expand)Author
2022-12-21Update direct conv2d kernel in dynamic fusionGian Marco Iodice
2022-12-14Optimize Transposed Convolution for CL backend (Quantized)Gunes Bayir
2022-12-09Implement the OpenCL kernel to compute the indirect convolutionGian Marco Iodice
2022-11-29Adding GpuAdd to dynamic fusion operatorsRamy Elgammal
2022-11-25Implement address precalculation for indirect conv2d - OpenCLGian Marco Iodice
2022-11-22Remove dynamic fusion prototype with tests and examplesSiCong Li
2022-11-14Optimize Transposed Convolution for CL backend (FP32/16)Gunes Bayir
2022-11-14Optimize T_QUANTIZE8_ASYMMETRIC for Maliā„¢ G52Pablo Marquez Tello
2022-11-03Fix activation block in gemm.clGian Marco Iodice
2022-11-02Partially Revert "Add threshold for floating-point SOFT_RELU activation"Gunes Bayir
2022-11-01Add threshold for floating-point SOFT_RELU activationMilos Puzovic
2022-11-01Rewrite dynamic fusionSiCong Li
2022-11-01Rework direct convolution heuristic on OpenCLGian Marco Iodice
2022-10-07Workaround CL compiler issue on FP16Viet-Hoa Do
2022-10-06Rework DepthwiseConvolution heuristic on OpenCLGian Marco Iodice
2022-10-06Improve start-up time in gemmlowp reshaped rhs only.Adnan AlSinan
2022-10-03Force CL kernel compilation with 64 registersViet-Hoa Do
2022-09-14Adding GELU activationMurray Kornelsen
2022-09-09Add a macro guard in all OpenCL kernels in gemmlowp.clGian Marco Iodice
2022-09-07Optimize depthwise convolution on OpenCLGian Marco Iodice
2022-09-02F16 Specialization for MeanStdDevNormMurray Kornelsen
2022-09-02Enable Winograd-based conv2d when IFM>=8 on GpuGian Marco Iodice
2022-07-22Add GemmLowp MMUL Reshaped Only Rhs Support for QASYMM8/QASYMM8_SIGNEDFreddie Liardet
2022-07-21Fix direct convolution cases that were failing on OdroidAdnan AlSinan
2022-07-13Add Gemm MMUL Reshaped Only Rhs Support for FP32/FP16Gunes Bayir
2022-07-05Add G57 to GPUTargetSiCong Li
2022-06-27Implement new Elementwise Dynamic Fusion Operators: Div, FloorMichalis Spyrou
2022-06-23Enable loading OpenCL symbols in Android App native code, NDK environment.ohadagoogle
2022-06-15Add support OpenCL 3.0 non-uniform workgroupViet-Hoa Do
2022-06-15Fix performance regression in Winograd Output Transform (OpenCL)Gian Marco Iodice
2022-05-31Add cl_khr_integer_dot_product extension supportViet-Hoa Do
2022-05-26Disable unsafe FP optimizations causing accuracy issuesGunes Bayir
2022-05-09Mismatches in dynamically fused direct conv2d + add kernelMichalis Spyrou
2022-05-06Integrate Dynamic Fusion patchesSiCong Li
2022-04-19Add CLPool3d Int8 SupportMohammed Suhail Munshi
2022-04-14Include missing embedded headersSiCong Li
2022-03-15Implementation of ClPooling3dramelg01
2022-03-08Merge kernel prototype patchGiorgio Arena
2022-02-11Improve start-up time for concatenation layersramelg01
2022-02-10Improve start-up time for winograd_output_transform_*_nhwcramelg01
2022-02-09Remove deprecated remap functions.Adnan AlSinan
2022-02-09Improve start-up time for winograd_input_transform_*_nhwcramelg01
2022-02-08Improve start-up time for winograd_filter_transform_*_nhwcramelg01
2022-02-02Revert "Rework gemm_mm_reshaped_only_rhs_ kernels with new macros"Ramy Elgammal
2022-01-25Rework gemm_mm_reshaped_only_rhs_ kernels with new macrosGian Marco Iodice
2021-12-24Replacing non-inclusive terms with proper termsramelg01
2021-12-23Rework gemm_reshape_lhs_ with new macrosAdnan AlSinan
2021-12-13Remove padding from ClDirectConv2dKernelAdnan AlSinan
2021-12-10Use #if directive instead of regular condition in CLDirectConv2DGiorgio Arena
2021-12-01Improve start-up direct convolution on OpenCLGian Marco Iodice