aboutsummaryrefslogtreecommitdiff
path: root/docs/user_guide
AgeCommit message (Expand)Author
4 daysAdd fp16 and integer data type support for ScatterNd in GpuGunes Bayir
9 daysAdd update/index/output (m+1)/2d/(m+n) support for CLScatterGunes Bayir
2024-04-17Update documentation for 24.04 releaseMichael Kozlov
2024-04-15Add s8f32 kernels and dynamic QuantizationInfoJonathan Deakin
2024-04-11Add SME2 implementation of softmax for FP16Gunes Bayir
2024-04-11Add in place summation to CPU GEMM kernelsRadu Salavat
2024-04-02Add SME2 implementation of softmax for FP32Viet-Hoa Do
2024-03-21[ONCPUML-1451] Add matmul kernel to enable bf16 to bf16 operations via PyTorc...Renato Arantes
2024-03-12Optimize CpuSoftmaxKernel for axis != 0 and neon kernelsOmar Al Khatib
2024-03-12Update documentation for 24.02.1 releaseFelix Thomasmathibalan
2024-03-11Prefer indirect Gemm vs. Direct convolution if supportedGunes Bayir
2024-02-21Integrate new pretranspose_b_array with extra fused transpose of BGunes Bayir
2024-02-12Update documentation for 24.02 releaseFelix Thomasmathibalan
2024-02-07Parallelize CPU depthwise over batch if only 1 rowJonathan Deakin
2024-02-02Add scripts to generate Doxygen documentationJakub Sujak
2024-01-18Improved documentationPablo Marquez Tello
2024-01-10Use look up table for fp16 activationMohammed Suhail Munshi
2023-12-22Add Mali™-G720 and Mali™-G620 as GpuTargetsGunes Bayir
2023-12-07Optimize CPU depth-to-spaceViet-Hoa Do
2023-12-05Optimize CpuSoftmaxKernel for axis=0Gunes Bayir
2023-11-28Changes to enable FP16 in armv8a multi_isaPablo Marquez Tello
2023-11-23Remove the legacy core libraryJakub Sujak
2023-11-14Update Release notes for 23.11Anitha Raj
2023-11-10Update list of supported operators in documentationJakub Sujak
2023-11-08Document how to build ACL with LLVM+Clang toolchainGunes Bayir
2023-11-08Optimize CpuGemmConv2d start-up timeSiCong Li
2023-11-01Add support for Arm® Cortex®-A520 and Arm® Cortex®-R82Viet-Hoa Do
2023-10-31[GPU] Update Reverse layer to allow negative axis and reversed axis orderAdnan AlSinan
2023-10-31Extend CKW MatMul with nt_tAdnan AlSinan
2023-10-31Optimize CL softmaxViet-Hoa Do
2023-10-11Optimize CL reduction operationViet-Hoa Do
2023-10-10Optimize NEStackLayerGunes Bayir
2023-10-05Optimize CLTranspose operatorJakub Sujak
2023-10-04Port DepthwiseConv2d operator to Ckwramy.elgammal@arm.com
2023-09-27Implement tflite compliant reverse for CPUAdnan AlSinan
2023-09-18Add CL command buffer classViet-Hoa Do
2023-09-15Remove deprecated support for BF16 in CpuCastAdnan AlSinan
2023-09-14Add skeleton of ClMatMulLowpNativeMMULKernelGunes Bayir
2023-09-04Remove legacy PostOps codeJakub Sujak
2023-08-31Port ClTemplatePool2d to ckwAdnan AlSinan
2023-08-30Port Resize operator to CKWGunes Bayir
2023-08-22CPU: Depthwise: Generate correct size for input indirection array.David Mansell
2023-08-22Optimize CpuReshapeKernelAnitha Raj
2023-08-15Update errataramy.elgammal@arm.com
2023-08-14Update OpenCL headers to v2023.04.17Viet-Hoa Do
2023-08-11Update Readme for 23.08ramy.elgammal@arm.com
2023-08-10Update Release Notesramy.elgammal@arm.com
2023-08-08Add support for S64 output in NEArgMinMaxLayerPablo Marquez Tello
2023-08-07Port DirectConv2d to CKW backendJakub Sujak
2023-08-07Document the Conv2D heuristicGian Marco Iodice