Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2916026.2916028acmconferencesArticle/Chapter ViewAbstractPublication PageshpdcConference Proceedingsconference-collections
research-article

Autotuning of MPI Applications Using PTF

Published: 31 May 2016 Publication History

Abstract

The main problem when trying to optimize the parameters of libraries, such as MPI, is that there are many parameters that users can configure. Moreover, predicting the behavior of the library for each configuration is non-trivial. This makes it very difficult to select good values for these parameters. This paper proposes a model for autotuning MPI applications. The model is developed to analyze different parameter configurations and is expected to aid users to find the best performance for executing their applications. As part of the AutoTune project, our work is ultimately aiming at extending Periscope to apply automatic tuning to parallel applications. In particular, our objective is to provide a straightforward way of tuning MPI parallel codes. The output of the framework are tuning recommendations that can be integrated into the production version of the code. Experimental tests demonstrate that this methodology could lead to significant performance improvements.

References

[1]
Autotune. Periscope Tuning Framework, March, 2016. http://periscope.in.tum.de.
[2]
S. Benedict, V. Petkov, and M. Gerndt. PERISCOPE: An Online-Based Distributed Performance Analysis Tool. In M. S. Müller, M. M. Resch, A. Schulz, and W. E. Nagel, editors, Tools for High Performance Computing 2009, pages 1--16. Springer Berlin Heidelberg, 2010. 10.1007/978-3-642-11261-4-1.
[3]
P. Caymes-Scutari, A. Morajko, T. Margalef, and E. Luque. Scalable dynamic Monitoring, Analysis and Tuning Environment for parallel applications. J. Parallel Distrib. Comput., 70(4):330--337, 2010.
[4]
M. Chaarawi, J. M. Squyres, E. Gabriel, and S. Feki. A Tool for Optimizing Runtime Parameters of Open MPI. In A. L. Lastovetsky, M. T. Kechadi, and J. Dongarra, editors, PVM/MPI, volume 5205 of Lecture Notes in Computer Science, pages 210--217. Springer, 2008.
[5]
I.-H. Chung and J. K. Hollingsworth. Using Information from Prior Runs to Improve Automated Tuning Systems. In Proceedings of the 2004 ACM/IEEE Conference on Supercomputing, SC '04, pages 30--, Washington, DC, USA, 2004. IEEE Computer Society.
[6]
I. Corporation. Intel® MPI Library. Reference Manual for Linux* OS, 2003--2014.
[7]
G. Fursin, Y. Kashnikov, A. Memon, Z. Chamski, O. Temam, M. Namolaru, E. Yom-Tov, B. Mendelson, A. Zaks, E. Courtois, F. Bodin, P. Barnard, E. Ashton, E. Bonilla, J. Thomson, C. Williams, and M. O'Boyle. Milepost GCC: Machine Learning Enabled Self-tuning Compiler. International journal of parallel programming, 39(3):296--327, 2011.
[8]
M. Haneda, P. Knijnenburg, and H. Wijshoff. Automatic selection of compiler options using non-parametric inferential statistics. In Parallel Architectures and Compilation Techniques, 2005. PACT 2005. 14th International Conference on, pages 123--132, Sept 2005.
[9]
IBM MPI: An MPI implementation for SuperMUC https://www.lrz.de/services/software/parallel/mpi/ibmmpi/.
[10]
S. Kukkonen and J. Lampinen. GDE3: The third evolution step of generalized differential evolution. In Evolutionary Computation, 2005. The 2005 IEEE Congress on, volume 1, pages 443--450. IEEE, 2005.
[11]
R. Miceli, G. Civario, A. Sikora, E. César, M. Gerndt, H. Haitof, C. Navarrete, S. Benkner, M. Sandrieser, L. Morin, and F. Bodin. AutoTune: A Plugin-Driven Approach to the Automatic Tuning of Parallel Applications. In P. Manninen and P. Öster, editors, Applied Parallel and Scientific Computing, volume 7782 of Lecture Notes in Computer Science, pages 328--342. Springer Berlin Heidelberg, 2013.
[12]
Z. Pan and R. Eigenmann. Fast and Effective Orchestration of Compiler Optimizations for Automatic Performance Tuning. In Proceedings of the International Symposium on Code Generation and Optimization, CGO '06, pages 319--332, Washington, DC, USA, 2006. IEEE Computer Society.
[13]
S. Pellegrini, J. Wang, T. Fahringer, and H. Moritsch. Optimizing MPI Runtime Parameter Settings by Using Machine Learning. In Recent Advances in Parallel Virtual Machine and Message Passing Interface, 16th European PVM/MPI Users' Group Meeting, Espoo, Finland, September 7--10, 2009. Proceedings, pages 196--206, 2009.
[14]
M. Püschel, J. M. F. Moura, B. Singer, J. Xiong, J. Johnson, D. Padua, M. Veloso, and R. W. Johnson. Spiral: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms. Int. J. High Perform. Comput. Appl., 18(1):21--45, Feb. 2004.
[15]
R. Solar, R. Suppi, and E. Luque. High performance distributed cluster-based individual-oriented fish school simulation. In ICCS, pages 76--85, 2011.
[16]
A. Tiwari, C. Chen, J. Chame, M. Hall, and J. Hollingsworth. A scalable auto-tuning framework for compiler optimization. In Parallel Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on, pages 1--12, May 2009.
[17]
S. Triantafyllis, M. Vachharajani, N. Vachharajani, and D. I. August. Compiler Optimization-space Exploration. In Proceedings of the International Symposium on Code Generation and Optimization: Feedback-directed and Runtime Optimization, CGO '03, pages 204--215, Washington, DC, USA, 2003. IEEE Computer Society.
[18]
R. Vuduc, J. W. Demmel, and K. A. Yelick. Oski: A library of automatically tuned sparse matrix kernels. Journal of Physics: Conference Series, 16(1):521, 2005.
[19]
R. C. Whaley, A. Petitet, and J. J. Dongarra. Automated empirical optimizations of software and the ATLAS project. Parallel Computing, 27(1--2):3--35, 2001. New Trends in High Performance Computing.

Cited By

View all
  • (2023)A Statistical Analysis of HPC Network TuningProceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis10.1145/3624062.3624114(458-465)Online publication date: 12-Nov-2023
  • (2021)The MPI Tool Interfaces: Past, Present, and Future—Capabilities and ProspectsTools for High Performance Computing 2018 / 201910.1007/978-3-030-66057-4_3(55-83)Online publication date: 23-May-2021
  • (2020)Predicting MPI Collective Communication Performance Using Machine Learning2020 IEEE International Conference on Cluster Computing (CLUSTER)10.1109/CLUSTER49012.2020.00036(259-269)Online publication date: Sep-2020
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SEM4HPC '16: Proceedings of the ACM Workshop on Software Engineering Methods for Parallel and High Performance Applications
May 2016
54 pages
ISBN:9781450343510
DOI:10.1145/2916026
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 May 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. autotuning
  2. mpi
  3. ptf
  4. runtime parameters

Qualifiers

  • Research-article

Funding Sources

  • EU Horizon 2020 project RADEX
  • MINECO-Spain
  • GenCat-DIUiE(GRR)
  • EU FP7 project AutoTune

Conference

HPDC'16
Sponsor:

Acceptance Rates

SEM4HPC '16 Paper Acceptance Rate 5 of 11 submissions, 45%;
Overall Acceptance Rate 8 of 16 submissions, 50%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)10
  • Downloads (Last 6 weeks)0
Reflects downloads up to 21 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)A Statistical Analysis of HPC Network TuningProceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis10.1145/3624062.3624114(458-465)Online publication date: 12-Nov-2023
  • (2021)The MPI Tool Interfaces: Past, Present, and Future—Capabilities and ProspectsTools for High Performance Computing 2018 / 201910.1007/978-3-030-66057-4_3(55-83)Online publication date: 23-May-2021
  • (2020)Predicting MPI Collective Communication Performance Using Machine Learning2020 IEEE International Conference on Cluster Computing (CLUSTER)10.1109/CLUSTER49012.2020.00036(259-269)Online publication date: Sep-2020
  • (2020)A performance- and energy-oriented extended tuning process for time-step-based scientific applicationsThe Journal of Supercomputing10.1007/s11227-020-03402-yOnline publication date: 25-Aug-2020
  • (2019)Auto-tuning methodology for configuration and application parameters of hybrid CPU + GPU parallel systems based on expert knowledge2019 International Conference on High Performance Computing & Simulation (HPCS)10.1109/HPCS48598.2019.9188060(551-558)Online publication date: Jul-2019
  • (2018)Cooperative rendezvous protocols for improved performance and overlapProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.5555/3291656.3291694(1-13)Online publication date: 11-Nov-2018
  • (2018)Dynamic Tuning of OpenMP Memory Bound Applications in Multisocket Systems using MATEWorkshop Proceedings of the 47th International Conference on Parallel Processing10.1145/3229710.3229748(1-10)Online publication date: 13-Aug-2018
  • (2018)Autotuning MPI Collectives using Performance GuidelinesProceedings of the International Conference on High Performance Computing in Asia-Pacific Region10.1145/3149457.3149461(64-74)Online publication date: 28-Jan-2018
  • (2018)Cooperative rendezvous protocols for improved performance and overlapProceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis10.1109/SC.2018.00031(1-13)Online publication date: 11-Nov-2018
  • (2017)MPI performance engineering with the MPI tool interfaceProceedings of the 24th European MPI Users' Group Meeting10.1145/3127024.3127036(1-11)Online publication date: 25-Sep-2017
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media