aboutsummaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2018-11-02COMPMID-556: Rename CPP folder to referenceGeorgios Pinitas
Change-Id: I147644349547c4e3804a80b564a9ad95131ad2d0 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/111560 Reviewed-by: Michalis Spyrou <michalis.spyrou@arm.com> Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com>
2018-11-02IVGCVSW-656 : added support (and unit tests) for asymmetric padding inJaroslaw Rzepecki
dirct conv (CL) Change-Id: I4b8389376e675bfa93b4a1ae7c8e65b8db1f4c4b Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/111102 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com>
2018-11-02COMPMID-715 Shrink window to size 0 if static and not resizableGiorgio Arena
Change-Id: Ifddf920bd5b8057a5a072ad64c68c9df2ba1c93c Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/111018 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com>
2018-11-02COMPMID-692 Consistent names for the interfacesGiorgio Arena
Change-Id: I4b1f3f0da9ff5342c7de7083736fe91871d14e5b Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110351 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com>
2018-11-02COMPMID-706 - Add GEMMLowp output stage for scaling by a fixed point numberGian Marco
DoD: - Implement NEON kernel for quantizing down the gemmlowp result. The result should be scaled by a fixedpoint number - Implement OpenCL kernel for quantizing down the gemmlowp result. The result should be scaled by a fixedpoint number - Add test for validating the result Required for: - Integration of GEMMLowp in Android NN - Convolution quantized - Fully connected quantized Change-Id: Ia963d25d695471e963961fb49a5600e78374ac4f Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110981 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com>
2018-11-02COMPMID-617: Add validate support for NEON PixelWiseMultiplicationIoan-Cristian Szabo
Change-Id: Ie81a4d667146315fed7668cf2ca752d3bf49b0ab Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/111013 Reviewed-by: Anthony Barbier <anthony.barbier@arm.com> Reviewed-by: Michalis Spyrou <michalis.spyrou@arm.com> Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com>
2018-11-02APPBROWSER-330: Workaround for the free unallocated value issue on AndroidJoel Liang
Added a default destructor in cpp file to workaround the free unallocated value issue on Android Change-Id: I3db84a0504a51b9b90823cbb37aa1cace06870e3 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/111438 Reviewed-by: Frank Lei <frank.lei@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com> Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com>
2018-11-02APPBROWSER-329: Fix the adjusting group size issueszhenglin
Change-Id: I2171b79e85eb316bf6bfb09ab8a54880b7708a4c Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/111435 Reviewed-by: Stephen Li <stephen.li@arm.com> Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Joel Liang <joel.liang@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com>
2018-11-02COMPMID-687: Winograd layer.Pablo Tello
Change-Id: Ica682d08e851491bf4a26b8d17908c014844055e Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110990 Reviewed-by: Anthony Barbier <anthony.barbier@arm.com> Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com>
2018-11-02COMPMID-617 Add validation window to CLActivationLayerGiorgio Arena
Change-Id: I56e0655a2f3448691cfd7bef6911d7959da12fdd Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/111186 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com>
2018-11-02COMPMID-684: 2D In-Map normalization support for CLGeorgios Pinitas
Change-Id: I73a11ef3ff7265abce196b128413f54623d33cae Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/111294 Reviewed-by: Anthony Barbier <anthony.barbier@arm.com> Reviewed-by: Pablo Tello <pablo.tello@arm.com> Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com>
2018-11-02COMPMID-661: QASYMM8 support for fully connected layer.Georgios Pinitas
Change-Id: I70e04d3a175ba366432ada98e9ca893c9f81b260 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/111094 Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com> Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com>
2018-11-02COMPMID-617: Add validate support for NEON BatchNormalizationLayer.Ioan-Cristian Szabo
Change-Id: I037ec6df7eee06bdd1381e908677803426fa614c Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110788 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com>
2018-11-02COMPIMID-553: MobileNet use case.Georgios Pinitas
Change-Id: I1181abbd5785065f3d57e91844376a4b110938a9 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110701 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com>
2018-11-02COMPMID-695: Update Phase and Validation WrappingJohn Richardson
Simplify Phase reference implementation so that its results are more inline with the CL implementation (note: NEON uses a fast arctan approximation). Modify validate_wrap function to limit use to Integer types only. Change-Id: Ie4222568a8ef2587cab8e6d478745c5d0ded3d57 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110192 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com> Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com>
2018-11-02Fix border handler for CLDepthwiseConvolution3x3Diego Lopez Recas
Change-Id: Ibf855a8ff86750978ecb81441369c83bc766f143 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110987 Reviewed-by: Anthony Barbier <anthony.barbier@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com> Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com>
2018-11-02COMPMID-701 Fixed std::bad_alloc in NEON DepthwiseConvolutionLayerMichalis Spyrou
Change-Id: I249749c38a5f96c0d1fdbc217acb435307627499 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/111074 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com>
2018-11-02COMPMID-710 - Fix CLTranspose and NETranspose when the input shape is aGian Marco
vector Fixed replacing AccessWindowTranspose with AccessWindowStatic due to the wrong padding calculation in the X direction within AccessWindowTranspose. AccessWindowTranspose should be fixed with COMPMID-708 Change-Id: I665e130b897a213ae2bf9da4bf092dd491fa00c6 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/111057 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Michel Iwaniec <michel.iwaniec@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com>
2018-11-02IVGCVSW-796 IVGCVSW-819 add NEON kernel U8 ReshapeLes Bell
Change-Id: I5cc348689f8fd3e3061e4a16c2540aab6da26639 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/111160 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com>
2018-11-02COMPMID-713: Address failures in OCLGrind for CLDirectConvolutionGeorgios Pinitas
-Changes way of clamping in the kernel side. -Fills padding with quantized values Change-Id: I94d17c341fd637fbb24390722162b551b62d16cb Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/111114 Reviewed-by: Anthony Barbier <anthony.barbier@arm.com> Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com>
2018-11-02COMPMID-712: OCLGrind CLSoftmaxLayer quantized failuresGeorgios Pinitas
Change-Id: I480eb8ad55b632c7d75b1a89e952e77b0ebbeda5 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/111158 Reviewed-by: Anthony Barbier <anthony.barbier@arm.com> Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com>
2018-11-02COMPMID-617 Add validation window to BatchNormalization, ↵Giorgio Arena
PixelwiseMultiplication, ArithmeticOps Change-Id: I306bd23fcd9b7eb7a248dc762aae899b28300b90 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110763 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com> Reviewed-by: Ioan-Cristian Szabo <ioan-cristian.szabo@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com>
2018-11-02COMPMID-714: Resolve failures in OCLGrind for CLPhaseGeorgios Pinitas
Change-Id: I4b5150476839649e6c3005a54f01e0788519bfb1 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/111101 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com>
2018-11-02COMPMID-661 Add Bifrost lws heuristics for several depthwise_convolution ↵Anthony Barbier
kernels #49 Change-Id: Ibfa1c1cc9fc8501b22a18ecd519758f4aeb301eb Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110880 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com> Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com>
2018-11-02COMPMID-617: Add validation to NEON functions.Georgios Pinitas
Adds validation to: - NECol2Im - NEIm2Col Change-Id: I346298583a6985ea793f71bb4527aa216a5cd4b2 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110835 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Michalis Spyrou <michalis.spyrou@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com>
2018-11-02COMPMID-556: Fix CLDepthwiseConvolution3x3 Kernel.Georgios Pinitas
Kernel was not sliding the input window. Change-Id: Ia5903ceaed1243e86bee773a84102d8a1132dfa5 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/111055 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com>
2018-11-02COMPMID-592: Ported GEMMInterleave4x4 tests.Pablo Tello
Change-Id: I161fa095b2b8a719bb152366294f62c1aad073ce Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110463 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com> Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com>
2018-11-02COMPMID-632: Integrated Assembly kernel GEMM U8 for Arm Cortex-A53.Michalis Spyrou
Change-Id: I053444f3cd4d0124df3a4a7aa8533b8395fb1336 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110659 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com>
2018-11-02COMPMID-617 Add validation window to CLNormalizationLayerGiorgio Arena
Change-Id: Iada46e5656bd6d16e5d965e2d38df9e60d5785c7 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110950 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com>
2018-11-02COMPMID-617 Add validation window to CLPoolingLayerGiorgio Arena
Change-Id: I1ef117399ff694c34178dd973458d52cd5ebf1f6 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110939 Reviewed-by: Anthony Barbier <anthony.barbier@arm.com> Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com>
2018-11-02COMPMID-661 Add optimal lws heuristics for the Bifrost direct_convolution ↵Anthony Barbier
kernels #45 Change-Id: I9e7ec5ed937fb4e8cab44a11c49a93f3aa01bedb Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110877 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com>
2018-11-02COMPMID-697 - Rework GEMMLowp interface on OpenCLGian Marco
Reworked the interface of GemmLowp in order to make easy the integration in Android NN - Added support for different output stage - Added validation for both matrix multiplication and output stage - Added bounded relu support in the output stage - Added in32_t bias support - Added optimized path for vector by matrix case This rework is required for: - Convolution quantized - Fully connected quantized Change-Id: I512283d406099cf8c614dd89d0a97ed411143afc Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110625 Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com> Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com>
2018-11-02COMPMID-556 - Add QASYMM8 support for missing OpenCL kernelsGian Marco
- CLCol2ImKernel - CLGEMMIntereleave4x4Kernel - CLIm2ColKernel - CLTransposeKernel - CLWeightsReshapeKernel Change-Id: I2c6066f59f078cfe88ed0f3d2f61db137d375159 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110790 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com>
2018-11-02COMPMID-694: merge opencl_timer flag with the opencl flagAnthony Barbier
Is there any reason why we might have OpenCL enabled but wouldn't want to make the opencl_timer instrument available ? Change-Id: I946d643028fc394a607478415e4f84c4e8fa5bc4 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110466 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Kevin Petit <kevin.petit@arm.com> Reviewed-by: Ioan-Cristian Szabo <ioan-cristian.szabo@arm.com> Reviewed-by: Michalis Spyrou <michalis.spyrou@arm.com> Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com> Reviewed-by: Giorgio Arena <giorgio.arena@arm.com>
2018-11-02COMPMID-699 - Add NEON functions for im2col and col2imGian Marco
Change-Id: Ie4a5cbd42f412d28de92d787e955e7d977918371 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110737 Reviewed-by: Michalis Spyrou <michalis.spyrou@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com> Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com>
2018-11-02COMPMID-556: Fix lws_hint CLSoftmaxLayerGeorgios Pinitas
Change-Id: I15347bcfc68d76d47efa06b6036e1ed81548d227 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110678 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com>
2018-11-02COMPMID-632: Integrated Assembly kernel GEMM S8 for Arm Cortex-A53.Pablo Tello
Change-Id: I07faa0c984759a1b5db1e5de71f4ab3eef5888d8 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110334 Reviewed-by: Michalis Spyrou <michalis.spyrou@arm.com> Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com>
2018-11-02COMPMID-464 Implement Depthwise separable convolution on NEONMichalis Spyrou
Change-Id: Icdaf7c2666a7fbb9b3c32ba2d70b667a59a6eeb9 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110468 Reviewed-by: Pablo Tello <pablo.tello@arm.com> Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com>
2018-11-02COMPMID-554 Add NodesMichalis Spyrou
- DepthwiseConvolutionLayer Change-Id: Icaef85d7474f7532bf7d93d11b5c787712e57bdd Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110524 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com>
2018-11-02COMPMID-632 Assembly: Integrate gemmlowp assembly versionMichalis Spyrou
Integrate generic gemmlowp assembly version for u8. Change-Id: I17ed4494c25a132b2bac581febe1544e49b4f352 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110114 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Pablo Tello <pablo.tello@arm.com>
2018-11-02COMPMID-556 Improved indentation and error handling in format_doxygen.pyAnthony Barbier
Change-Id: I6f51ffe6c324d9da500716b52c97c344f2a2a164 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110486 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com>
2018-11-02COMPMID-556 Print tests list using printers rather than coutAnthony Barbier
Will help with scripting to split the tests to run across several boards Change-Id: I32806c3cd03cd1b4af1865cd4fdb0422d609eff0 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110535 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com>
2018-11-02COMPMID-556 Add saturation to 8-bit activation. This prevents undefined overflowRob Hughes
from occurring when an output value cannot be expressed in the output space Change-Id: I4045a44ee257a829f6cfe708dcbad37bb91c0ee4 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110393 Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com> Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com>
2018-11-02COMPMID-464 Implement Depthwise separable convolution on NEONMichalis Spyrou
Change-Id: Iccd686be18381e96bcf09b14c7017c6dda0f38d8 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/109824 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Pablo Tello <pablo.tello@arm.com>
2018-11-02COMPMID-661: Fix scale border issue (#38)Daniil Efremov
Change-Id: If1dcca724e5e5f5ab363ffc16b0ef8c943e0b657 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110105 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com>
2018-11-02COMPMID-694: Fix condition which decides whether or not to profile a runAnthony Barbier
Change-Id: I4853a4d83a13b162d5759434a4e800c8f9e05cce Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110108 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com>
2018-11-02COMPMID-556 - Passed _lws_hint in the run() method where was missingGian Marco
_lws_hint was missing in the run() method for some ML functions where _config_id was set Change-Id: I0a092372cccaf21f29e4c78340201de1606caab4 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110250 Reviewed-by: Anthony Barbier <anthony.barbier@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com> Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com>
2018-11-02COMPMID-696: Integrated assembly SGEMM for Arm Cortex-A55 and Arm Cortex-A55r1.Pablo Tello
Change-Id: I9c8fa13d53c7310cacf3446faa42026fdf3396fa Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110209 Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com> Reviewed-by: Michalis Spyrou <michalis.spyrou@arm.com>
2018-11-02COMPMID-556: Set CLSoftmaxLayer temp buffers to resizable.Georgios Pinitas
Auto-padding configuration lead the clone of input tensor info to be not resizable in computer_vision. Explicitly sets the tensor info clones to resizable. Change-Id: I8ec3d430127e6ed5cca7884c844654b6e71b7ee5 Reviewed-on: https://eu-gerrit-1.euhpc.arm.com/110194 Reviewed-by: Anthony Barbier <anthony.barbier@arm.com> Tested-by: BSG Visual Compute Jenkins server to access repositories on http://mpd-gerrit.cambridge.arm.com <bsgcomp@arm.com>
2018-11-02COMPMID-677: Integrate HGEMM assembly kernel (generic CPUs)Pablo Tello
Change-Id: I39abf367fe7ea1a54475e2ac0ecec12e90806899 Reviewed-on: http://mpd-gerrit.cambridge.arm.com/95378 Tested-by: Kaizen <jeremy.johnson+kaizengerrit@arm.com> Reviewed-by: Anthony Barbier <anthony.barbier@arm.com>