aboutsummaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Expand)Author
2023-08-17Fix various static check issuesViet-Hoa Do
2023-08-15Check CL command buffer extensionViet-Hoa Do
2023-08-15Fix out-of-scope CLBufferMemoryRegion's buffer still in queue issueSiCong Li
2023-08-14Optimize CLReduce for Min/Max Axis=0Gunes Bayir
2023-08-08Add support for S64 output in NEArgMinMaxLayerPablo Marquez Tello
2023-08-08Setup pre-commit and include code formatting scriptsGunes Bayir
2023-08-08Avoid using CLMatMul in CLFullyConnected when GPUTarget is Midgardramy.elgammal@arm.com
2023-08-07Port DirectConv2d to CKW backendJakub Sujak
2023-08-07Document the Conv2D heuristicGian Marco Iodice
2023-08-03Fix ReduceMean validate issueViet-Hoa Do
2023-08-03Fix CL Tile operatorViet-Hoa Do
2023-07-28Port ElementwiseBinary to CKW part 2SiCong Li
2023-07-28Retain back-compatibility for arm_compute/core/Types.hSiCong Li
2023-07-25Add GpuKernelArgumentBinding for runtime argument settingSiCong Li
2023-07-25Fix problem with exception handling in CPPSchedulerMatthew Bentham
2023-07-21Enable S64 output in CLArgMinMaxPablo Marquez Tello
2023-07-20Fix failing CTS tests by disabling matmul when weights conversion is required.Mohammed Suhail Munshi
2023-07-19Add support for input S64/U64 in CpuCastKernelPablo Marquez Tello
2023-07-18Break up core/Utils.h to reduce unused code being included everywhereMatthew Bentham
2023-07-14Port ClTemplateCast into CkwAdnan AlSinan
2023-07-14Port ClTemplateActivation into CkwAdnan AlSinan
2023-07-13Added S64/U64 support for the input in CLCastPablo Marquez Tello
2023-07-13Fix excessive calls to clReleaseCommandQueueSiCong Li
2023-07-13Enable premultiplication for depthwise convolutionMichael Tyler
2023-07-12Add compute kernel writer arguments exportViet-Hoa Do
2023-07-11Add Bias to MatMul Kernels and add support for use in Fully Connected LayerMohammed Suhail Munshi
2023-07-10Port operations to CKW prototypeNikolaj Jensen
2023-07-10Disable kernel size 3 in argminmax for axis 0Pablo Marquez Tello
2023-07-10Do not include headers necessary for logging when logging is disabledMatthew Bentham
2023-07-07Enable transpose convolution with non-square kernelsViet-Hoa Do
2023-07-07Fix unsupported configuration in CLFullyConnected validationGunes Bayir
2023-07-06Fix nightly failures in MatMulLowpNativeKernel when using bounded activation ...Mohammed Suhail Munshi
2023-07-06Move CKW prototype to separate directoryViet-Hoa Do
2023-07-05Rewrote CLArgMinMax for axis 0Pablo Marquez Tello
2023-07-05Fix unused function warningMichael Tyler
2023-07-04Depthwise channel pre-multiplicationMichael Tyler
2023-07-04Add Kernel Writer driver code to dynamic fusionSiCong Li
2023-06-29Implement FP32/16 MatMul Lhs T Rhs T/NT kernel using MMUL extensionGunes Bayir
2023-06-26Add helpers to set CKW tensor components as OpenCL kernel argumentsJakub Sujak
2023-06-26Remove dependency on fp16 definitions from some core include filesMatthew Bentham
2023-06-26Use MatMul in fully connected layer with dynamic weights when supportedMohammed Suhail Munshi
2023-06-23Implement FP32/FP16 MatMul NT/T kernel using the MMUL extensionRamy Elgammal
2023-06-23Address the issues with the ACL coverage pipeline failures related to matmul.Renato Arantes
2023-06-23Fix doxygen warningsramy.elgammal@arm.com
2023-06-22Bazel and CMake optional fp16 supportDavid Svantesson
2023-06-21Fix CPU depthwise convolution in case of large paddingViet-Hoa Do
2023-06-21Enable vmfa in arm7va/aarch32 when presentPablo Marquez Tello
2023-06-19Implement FP32/FP16 MatMul NT/NT kernel using the MMUL extensionSiCong Li
2023-06-16Add Fused Activation to OpenCL MatMulMohammed Suhail Munshi
2023-06-15Break up Utils.h a bit to reduce unused code being included everywhereMatthew Bentham