aboutsummaryrefslogtreecommitdiff
path: root/examples/gemm_tuner
AgeCommit message (Collapse)Author
2019-11-12COMPMID-2690 Extend Doxygen documents to include GEMM TunerSiCong Li
Signed-off-by: SiCong Li <sicong.li@arm.com> Change-Id: I998210c3c454c091cfe124f1151f0e052c83a0ef Reviewed-on: https://review.mlplatform.org/c/2264 Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com> Comments-Addressed: Arm Jenkins <bsgcomp@arm.com> Tested-by: Arm Jenkins <bsgcomp@arm.com>
2019-11-12COMPMID-2563 Change how the best overall GEMM configuration is selectedSiCong Li
* Based on a specified tolerance, each GEMMParam (GEMM Shape) now can have a set of best GEMM configurations, instead of just a single one. This improves the robustness and completeness of the tuned results, and is in compliance with how we define the GEMMParam archetypes (the main goal of this story) * The tuner then tries to find the best overall GEMMConfig, from all the best config sets, through the same voting mechanism: the config that gets voted the most is the best overall GEMMConfig. Change-Id: Ief770bb6ffc04629d91f1dc778eea69274e007f0 Signed-off-by: SiCong Li <sicong.li@arm.com> Reviewed-on: https://review.mlplatform.org/c/2228 Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com> Comments-Addressed: Arm Jenkins <bsgcomp@arm.com> Tested-by: Arm Jenkins <bsgcomp@arm.com> Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com>
2019-10-23COMPMID-2567: Create a python script to parse the CLGEMM benchmarks andSiCong Li
return the optimal configuration * Fix GEMM Reshaped example by adding the reshape lhs kernel. * Extend runner shell script to save result files with a defined file extension * Extend runner shell script to print out progress and time * Add python script * Update README Change-Id: I484ec8945aded4341743bc1024820523392b8ce3 Signed-off-by: SiCong Li <sicong.li@arm.com> Reviewed-on: https://review.mlplatform.org/c/2122 Comments-Addressed: Arm Jenkins <bsgcomp@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com> Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com> Tested-by: Arm Jenkins <bsgcomp@arm.com>
2019-09-30COMPMID-2564 - Add CLGEMMNative exampleSiCong Li
* Add gemm native example. * Update shell script to run gemm native example. Signed-off-by: SiCong Li <sicong.li@arm.com> Change-Id: I0252f556b6f94adb776b8cb84b45971f1eb317e4 Reviewed-on: https://review.mlplatform.org/c/1992 Tested-by: Arm Jenkins <bsgcomp@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com>
2019-09-27COMPMID-2565 - Add CLGEMMReshaped exampleSiCong Li
* Add gemm reshaped example. * Update shell script to run gemm reshaped example. * Use OPENCL_TIMER_MS for instrument * Increase number of iterations to 5 Signed-off-by: SiCong Li <sicong.li@arm.com> Change-Id: I41b3cebc5ee6368bf156d0e888101a3ff976a1b5 Reviewed-on: https://review.mlplatform.org/c/1990 Comments-Addressed: Arm Jenkins <bsgcomp@arm.com> Tested-by: Arm Jenkins <bsgcomp@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com>
2019-09-27COMPMID-2692 Use existing command line infrastructure forSiCong Li
CLGEMMReshapedOnlyRHS * Refactor to use existing command line infrastructure. * Fix: Remove errorneous initialisation code for dst matrix. * Fix: Correctly set bias tensor to be a vector. Change-Id: I787bfa08392df806aba3b0be09bab015f16010f7 Signed-off-by: SiCong Li <sicong.li@arm.com> Reviewed-on: https://review.mlplatform.org/c/1985 Comments-Addressed: Arm Jenkins <bsgcomp@arm.com> Tested-by: Arm Jenkins <bsgcomp@arm.com> Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com> Reviewed-by: Georgios Pinitas <georgios.pinitas@arm.com>
2019-09-25COMPMID-2566 - Add CLGEMMReshapedOnlyRHS exampleSiCong Li
* Add example * Add shell script for running example on predefined tunable configs Change-Id: I1660c37a071492b4da5cf97c1ced5ac4a08b19ce Signed-off-by: SiCong Li <sicong.li@arm.com> Reviewed-on: https://review.mlplatform.org/c/1961 Comments-Addressed: Arm Jenkins <bsgcomp@arm.com> Reviewed-by: Gian Marco Iodice <gianmarco.iodice@arm.com> Tested-by: Arm Jenkins <bsgcomp@arm.com>