aboutsummaryrefslogtreecommitdiff
path: root/src
AgeCommit message (Expand)Author
2023-05-03Guards to make NEReorder aarch64 onlyDavid Svantesson
2023-05-03Update a64_transpose_interleave_16.hppDavid Svantesson
2023-05-03Bazel and CMake updatesDavid Svantesson
2023-05-03Fix CPU MatMul broadcast detectionViet-Hoa Do
2023-05-02Fix export_to_cl_image issue in the fp16 GeMM implementationGian Marco Iodice
2023-05-02Add fp16 GeMM heuristic for Arm® Mali™-G710Gian Marco Iodice
2023-05-02Fix fully connected and matmul mismatchesViet-Hoa Do
2023-05-01Add Reorder to changelogDavid Svantesson
2023-04-28Reorder addedDavid Svantesson
2023-04-28Fix the gather layer indices checkViet-Hoa Do
2023-04-28Fix OMPScheduler run_workloads single thread issueSiCong Li
2023-04-27Avoid printing error message for each not found OpenCl libraryRamy Elgammal
2023-04-27Add quantized CL MatMul kernel for LHS NT, RHS TJakub Sujak
2023-04-26Change fp16 GeMM heuristic for Arm® Mali™-G77Gian Marco Iodice
2023-04-26Improve Winograd performance on OpenCLGian Marco Iodice
2023-04-26Add FP16 depthwise kernels for SME2David Mansell
2023-04-26Integrate multi-threaded pretranspose_B_arraySiCong Li
2023-04-25Fix rounding to nearest even for armv7aRamy Elgammal
2023-04-20Implement CL kernel for a native batched matmul Quantized - LHS transposed, R...Omar Al Khatib
2023-04-19NETranspose 8x8 kernel for 32-bit elementsEthan Doe
2023-04-19Add quantized support for CPU MatMulViet-Hoa Do
2023-04-17Add quantized CL MatMul kernels for Lhs NT/T, Rhs NTGunes Bayir
2023-04-17Fix unhandled case in ElementwiseUnaryRamy Elgammal
2023-04-14Align naming convention of ClMatMulJakub Sujak
2023-04-14Fix dynamic weights for CPU connected layerViet-Hoa Do
2023-04-13Implement MatMul Function and Operator with Floating Point support for CPUMohammed Suhail Munshi
2023-04-11Enable quantized data types for CpuElementwiseUnary on Armv7aRamy Elgammal
2023-04-04Support dynamic weights for Fully Connected layers on GPUJakub Sujak
2023-04-03Implement MatMul FunctionRamy Elgammal
2023-04-03Add Cropping to CLBatchToSpaceOmar Al Khatib
2023-03-30Add cropping support to NEBatchToSpaceSiCong Li
2023-03-29Fix GCC13 compiler errorsPablo Marquez Tello
2023-03-29Add quantized support for unary elementwise in CPUViet-Hoa Do
2023-03-27Fix BatchToSpaceFixtureSiCong Li
2023-03-24Add Texture Pipe Support for Matmul Lhs T/NT Rhs T kernelsRamy Elgammal
2023-03-24Work around CLScale compiler-specific issueSiCong Li
2023-03-24Add Texture Pipe Support for Matmul Lhs T/NT Rhs NT kernelsGunes Bayir
2023-03-23Round to nearest with ties to away from zero in ReluPablo Marquez Tello
2023-03-21gemm_interleaved: Set up the accumulation buffer properly in alternateDavid Mansell
2023-03-21Add dynamic weights for CPU fully connected layerViet-Hoa Do
2023-03-20Implement OpenCL MatMul for Lhs T Rhs T/NT FP32/16Gunes Bayir
2023-03-17Implementation of RSQRT for quantized int8Ramy Elgammal
2023-03-17Implement OpenCL MatMul for Lhs NT Rhs T/NT FP32/16Ramy Elgammal
2023-03-14Add CropInfo to BatchToSpace reference and fixtureSiCong Li
2023-03-13arm_gemm: Add SME2 FP16 kernels.David Mansell
2023-03-13[ONCPUML-1174] Allow src/weights mismatch for fixed formatJonathan Deakin
2023-03-08Add support for arbitrary parameters for CPU GatherViet-Hoa Do
2023-03-07Add sigmoid and tanh for dynamic fusionViet-Hoa Do
2023-03-07GEMM: SME: Allow threading for quantized GEMMs.David Mansell
2023-03-07Resolve the presence of variables that are unused in release mode in Dynamic ...Omar Al Khatib