Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3184407.3184409acmconferencesArticle/Chapter ViewAbstractPublication PagesicpeConference Proceedingsconference-collections
short-paper

Energy and Performance Analysis of Parallel Particle Solvers from the ScaFaCoS Library

Published: 30 March 2018 Publication History

Abstract

Performance analysis in high performance computing (HPC) has traditionally focused on improving application programs, for example, by decreasing the overall runtime or increasing the throughput of floating point operations. However, the same approaches might also be used to influence the energy behavior. Since the increasing energy consumption of HPC platforms is gaining more and more attention, the identification of applications and platforms for which energy and performance measurement lead to differing results is of great importance. In this article, we analyze the energy and performance behavior of particle solvers from the ScaFaCoS library. Four different criteria are investigated with respect to their influence on the energy consumption and achieved performance. These criteria are the solution method chosen, the parameters of the solution method, the degree of parallelism, and the parameters of the hardware platform.

References

[1]
A. Abedi and T. Brecht. 2017. Conducting repeatable experiments in highly variable cloud computing environments Int. Conf. on Performance Engineering (ICPE'17). ACM, 287--292.
[2]
J.I. Aliaga, M. Barreda, M.F. Dolz, and E.S. Quintana-Ort'ı. 2015. Are our dense linear algebra libraries energy-friendly? Computer Science-Research and Development Vol. 30, 2 (2015), 187--196.
[3]
A. Arnold. 2011. Fourier transformed-based methods for long-range interactions: Ewald, P$^3$M and more. Fast Methods for Long-Range Interactions in Complex Systems. IAS Series, Vol. Vol. 6. Forschungszentrum Jülich, 39--64.
[4]
A. Arnold, F. Fahrenberger, C. Holm, O. Lenz, M. Bolten, H. Dachsel, R. Halver, I. Kabadshow, F. Gahler, F. Heber, J. Iseringhausen, M. Hofmann, M. Pippig, D. Potts, and G. Sutmann. 2013. Comparison of scalable fast methods for long-range interactions. Physical Review E Vol. 88 (2013), 063308. Issue 6.
[5]
J. Barnes and P. Hut. 1986. A hierarchical $O(N łog N)$ force-calculation algorithm. Nature, Vol. 324, 6096 (1986), 446--449.
[6]
S. Browne, J. Dongarra, N. Garner, G. Ho, and P. Mucci. 2000. A portable programming interface for performance evaluation on modern processors. Int. J. of High Performance Computing Applications, Vol. 14, 3 (2000), 189--204.
[7]
J. Carretero, S. Distefano, D. Petcu, D. Pop, T. Rauber, G. Rünger, and D.E. Singh. 2015. Energy-efficient algorithms for ultrascale systems. Supercomputing Frontiers and Innovations Vol. 2, 2 (2015), 77--104.
[8]
J.W. Choi and R.W. Vuduc. 2016. Analyzing the energy efficiency of the fast multipole method using a DVFS-aware energy model. In Int. Parallel and Distributed Processing Symposium Workshops (IPDPSW'16). IEEE, 79--88.
[9]
H. Dachsel, M. Hofmann, J. Lang, and G. Rünger. 2012. Automatic tuning of the fast multipole method based on integrated performance prediction Int. Conf. on High Performance Computing and Communication (HPCC'12). IEEE, 617--624.
[10]
M. Etinski, J. Corbalán, J. Labarta, and M. Valero. 2012. Understanding the future of energy-performance trade-off via DVFS in HPC environments. J. of Parallel and Distributed Computing Vol. 72, 4 (2012), 579--590.
[11]
L. Greengard and V. Rokhlin. 1987. A fast algorithm for particle simulations. J. of Computational Physics Vol. 73 (1987), 325--348.
[12]
T. Jakobs, J. Lang, G. Rünger, and P. Stöcker. 2017. Tuning linear algebra for energy efficiency on multicore machines by adapting the ATLAS library. Future Generation Computer Systems (2017). (to appear).
[13]
E.A. León, I. Karlin, R.E. Grant, and M. Dosanjh. 2016. Program optimizations: The interplay between power, performance, and energy. Parallel Comput. Vol. 58 (2016), 56--75.
[14]
D. Li, B.R. de Supinski, M. Schulz, K. Cameron, and D.S. Nikolopoulos. 2010. Hybrid MPI/OpenMP power-aware computing. In Int. Symposium on Parallel Distributed Processing (IPDPS 2010). IEEE, 1--12.
[15]
M. Pippig and D. Potts. 2013. Parallel three-dimensional nonequispaced fast Fourier transforms and their application to particle simulation. SIAM J. on Scientific Computing Vol. 35, 4 (2013), C411--C437.
[16]
A. Podzimek, L. Bulej, L.Y. Chen, W. Binder, and P. Tuma. 2015. Analyzing the impact of CPU pinning and partial CPU loads on performance and energy efficiency. In Int. Symposium on Cluster, Cloud and Grid Computing (CCGrid'15). IEEE, 1--10.
[17]
M. Puzoviç, S. Manne, S. GalOn, and M. Ono. 2016. Quantifying energy use in dense shared memory HPC node Int. Workshop on Energy Efficient Supercomputing (E2SC 2016). IEEE, 16--23.
[18]
T. Rauber and G. Rünger. 2015. Modeling and analyzing the energy consumption of fork-join-based task parallel programs. Concurrency and Computation: Practice and Experience, Vol. 27, 1 (2015), 211--236.
[19]
T. Rauber, G. Rünger, and M. Schwind. 2014. Energy measurement and prediction for multi-threaded programs High Performance Computing Symposium (HPC 2014). Society for Computer Simulation International, 20:1--20:9.
[20]
T. Rauber, G. Rünger, M. Schwind, H. Xu, and S. Melzner. 2014. Energy measurement, modeling, and prediction for processors with frequency scaling. J. of Supercomputing, Vol. 70, 3 (2014), 1451--1476.
[21]
L. Tan, S. Kothapalli, L. Chen, O. Hussaini, R. Bissiri, and Z. Chen. 2014. A survey of power and energy efficient techniques for high performance numerical linear algebra operations. Parallel Comput. Vol. 40, 10 (2014), 559--573.
[22]
S. Wang, B. Luo, W. Shi, and D. Tiwari. 2016. Application configuration selection for energy-efficient execution on multicore systems. J. of Parallel and Distributed Computing Vol. 87 (2016), 43--54.
[23]
V.M. Weaver, D. Terpstra, H. McCraw, M. Johnson, K. Kasichayanula, J. Ralph, J. Nelson, P. Mucci, T. Mohan, and S. Moore. 2013. PAPI 5: Measuring power, energy, and the cloud Int. Symposium on Performance Analysis of Systems and Software (ISPASS'13). IEEE, 124--125.

Cited By

View all
  • (2020)A performance- and energy-oriented extended tuning process for time-step-based scientific applicationsThe Journal of Supercomputing10.1007/s11227-020-03402-yOnline publication date: 25-Aug-2020
  • (2019)Multiprocessor Task Programming and Flexible Load Balancing for Time-Stepping Methods on Heterogeneous Cloud Infrastructures2019 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI)10.1109/SmartWorld-UIC-ATC-SCALCOM-IOP-SCI.2019.00277(1537-1544)Online publication date: Aug-2019

Index Terms

  1. Energy and Performance Analysis of Parallel Particle Solvers from the ScaFaCoS Library

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ICPE '18: Proceedings of the 2018 ACM/SPEC International Conference on Performance Engineering
    March 2018
    328 pages
    ISBN:9781450350952
    DOI:10.1145/3184407
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 30 March 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. energy consumption
    2. particle simulations
    3. performance analysis

    Qualifiers

    • Short-paper

    Funding Sources

    • German Ministry of Science and Education (BMBF)

    Conference

    ICPE '18

    Acceptance Rates

    Overall Acceptance Rate 252 of 851 submissions, 30%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 20 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2020)A performance- and energy-oriented extended tuning process for time-step-based scientific applicationsThe Journal of Supercomputing10.1007/s11227-020-03402-yOnline publication date: 25-Aug-2020
    • (2019)Multiprocessor Task Programming and Flexible Load Balancing for Time-Stepping Methods on Heterogeneous Cloud Infrastructures2019 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/CBDCom/IOP/SCI)10.1109/SmartWorld-UIC-ATC-SCALCOM-IOP-SCI.2019.00277(1537-1544)Online publication date: Aug-2019

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media