aboutsummaryrefslogtreecommitdiff
path: root/src/runtime
AgeCommit message (Expand)Author
2023-06-16Add Fused Activation to OpenCL MatMulMohammed Suhail Munshi
2023-06-15Break up arm_compute/core/Types.h a bitMatthew Bentham
2023-06-09Reorder destructor in srcDavid Svantesson
2023-05-12Fix performance regression in FP16 DeconvolutionJakub Sujak
2023-05-11Remove check for bias in CPU Depthwise ConvolutionJakub Sujak
2023-05-10Remove inclusion of NEReorderKernel header from NEReorderLayerRamy Elgammal
2023-05-10Re-enable dyanmic weights in Neon™ depthwise convolutionRamy Elgammal
2023-05-05Disable dynamic weights in unsupported operatorsViet-Hoa Do
2023-05-05Make NECast::validate take args by const pointerMatthew Bentham
2023-05-04Implement OpenCL MatMul heuristic for Arm® Mali™-G710Gian Marco Iodice
2023-05-03Guards to make NEReorder aarch64 onlyDavid Svantesson
2023-05-02Add fp16 GeMM heuristic for Arm® Mali™-G710Gian Marco Iodice
2023-05-01Add Reorder to changelogDavid Svantesson
2023-04-28Reorder addedDavid Svantesson
2023-04-28Fix OMPScheduler run_workloads single thread issueSiCong Li
2023-04-26Change fp16 GeMM heuristic for Arm® Mali™-G77Gian Marco Iodice
2023-04-26Integrate multi-threaded pretranspose_B_arraySiCong Li
2023-04-14Fix dynamic weights for CPU connected layerViet-Hoa Do
2023-04-13Implement MatMul Function and Operator with Floating Point support for CPUMohammed Suhail Munshi
2023-04-04Support dynamic weights for Fully Connected layers on GPUJakub Sujak
2023-04-03Implement MatMul FunctionRamy Elgammal
2023-04-03Add Cropping to CLBatchToSpaceOmar Al Khatib
2023-03-30Add cropping support to NEBatchToSpaceSiCong Li
2023-03-21Add dynamic weights for CPU fully connected layerViet-Hoa Do
2023-03-14Add CropInfo to BatchToSpace reference and fixtureSiCong Li
2023-03-06Fix LWS search space used by CLTunerSiCong Li
2023-03-03Add weights_info as optional input for NEDeconvolutionLayerAnnop Wongwathanarat
2023-02-14Extend skip upsampling for deconvolution for non-1x1 kernelsAnnop Wongwathanarat
2023-02-09Fix performance regression in Transposed ConvolutionGunes Bayir
2023-02-01Add new operator AddMulAdd for Neon™ backend for Float/Quantized typesGunes Bayir
2023-02-01Remove fixed format strides hackJonathan Deakin
2023-01-30Skip upsampling for deconvolution when not neededAnnop Wongwathanarat
2023-01-23Fix performance regression when stride equal to kernel sizeGunes Bayir
2023-01-20Add enable_fast_math for NEDeconvolutionLayerAnnop Wongwathanarat
2023-01-18Revert "Update the heuristic for CLDepthwiseConvolutionNative kernel"Gian Marco Iodice
2023-01-12Update the heuristic for CLDepthwiseConvolutionNative kernelGian Marco Iodice
2022-12-30Move DWC native heuristic into the heuristic folderGian Marco Iodice
2022-12-29Update the ClConv2d heuristicGian Marco Iodice
2022-12-21Optimize MeanReduce by integer acc. and removing upfront dequant.Omar Al Khatib
2022-12-14Optimize Transposed Convolution for CL backend (Quantized)Gunes Bayir
2022-12-09Use heuristics for setting dynamic fusion direct conv2d tile sizesRamy Elgammal
2022-12-09Implement the OpenCL kernel to compute the indirect convolutionGian Marco Iodice
2022-11-22Remove dynamic fusion prototype with tests and examplesSiCong Li
2022-11-18Add num_threads_to_use to OMPScheduler based on workload sizecfRod
2022-11-14Optimize Transposed Convolution for CL backend (FP32/16)Gunes Bayir
2022-10-06Rework DepthwiseConvolution heuristic on OpenCLGian Marco Iodice
2022-09-16Optimize Quantized/Integer Bilinear Scale for Neon™Gunes Bayir
2022-09-09Optimize FP32/16 Bilinear Scale Kernel for Neon™Gunes Bayir
2022-09-07Optimize depthwise convolution on OpenCLGian Marco Iodice
2022-09-01Use parent buffer in CLSubTensor. This avoids calling enqueueMapBuffer repeat...Murray Kornelsen