HPL Performance Prevision to Intending System Improvement

Wenli Zhang^20,21,
Mingyu Chen²⁰ &
Jianping Fan²⁰

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3358))

Included in the following conference series:

International Symposium on Parallel and Distributed Processing and Applications

578 Accesses

Abstract

HPL is a parallel Linpack benchmark package widely adopted in massive cluster system performance test. On HPL data layout among processors, a law to determine block size NB theoretically, which breaks through dependence on trial-and-error experiments, is found based on in-depth analysis of blocked parallel solution algorithm of linear algebra equations and implementation mechanics in HPL. According to that law, an emulation model to toughly estimate HPL execution time is constructed. Verified by real system, the model is used to do some scientific prevision on the benefits to Linpack test brought by intending system improvement, such as respectively memory size increase, communication bandwidth increase and so on. It is expected to conduce to direct system improvement on optimizing HPL test in the future.

This research is supported by Chinese National High-tech Research and Development (863) Program (grants 2003AA 1Z2070) and by the foundation of Knowledge Innovation Program (grants 20036040), Chinese Academy of Sciences (CAS).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A Self-adaptive HPL-Based Benchmark with Dynamic Task Parallelism for Multicore Systems

A Survey About Quantitative Measurement of Performance Variability in High Performance Computers

Towards optimal scheduling policy for heterogeneous memory architecture in many-core system

Article 25 July 2018

References

Dongarra, J.J., Luszczek, P., Petitet, A.: The LINPACK Benchmark: Past, Present, and Future, Concurrency and Computation: Practice and Experience 15 (2003)
Google Scholar
Petitet, A., Whaley, R.C., Dongarra, J., Cleary, A.: HPL – A Portable Implementation of the High-Performance Linpack Benchmark for Distributed-Memory Computers, http://www.netlib.org/benchmark/hpl/
Meuer, H.W., Strohmaier, E., Dongarra, J.J., Simon, H.D.: Top500 Supercomputer Sites, 17th edn., November 2 (2001), The report can be downloaded from, http://www.netlib.org/benchmark/top500.html
Zhang, B.L., et al.: Theory and method of numeric parallel computing. National defense industry press, Beijing (1999)
Google Scholar
Lin, C.S.: Numeric computing method (Column A). Science press, Beijing (1998)
Google Scholar
Chen, G.L.: Parallel computing: structure, algorithm, programming (modified version), p. 8. Advanced education press, Beijing (2003)
Google Scholar
Sun, Z.Z.: Numeric analysis, 2nd edn., p. 1. South-east university press, Nanjing (2002)
Google Scholar
http://www.cs.utk.edu/~dongarra/WEB-PAGES/SPRING-2000/lect08.pdf
Wenli, Z., Jianping, F., Mingyu, C.: Efficient Determination of Block Size NB for Parallel Linpack Test. In: Proceedings of the IASTED International Conference on Parallel and Distributed Computing and Systems (PDCS 2004). MIT, Cambridge (2004) (received)
Google Scholar
Caron, E., Utard, G.: On the performance of parallel factorization of out-of-core matrices. Parallel computing 30, 357–375 (2004)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

National Research Center for Intelligent Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences,
Wenli Zhang, Mingyu Chen & Jianping Fan
Graduate School of the Chinese Academy of Sciences, NCIC, P.O. Box 2704, Beijing, 100080, P.R. China
Wenli Zhang

Authors

Wenli Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Mingyu Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jianping Fan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computing, Hong Kong Polytechnic University, Kowloon, Hong Kong, China
Jiannong Cao
Department of Computer Science, St. Francis Xavier University, Antigonish, Canada
Laurence T. Yang
Department of Computer Science and Engineering, Shanghai Jiao Tong University, 200030, Shanghai, China
Minyi Guo
Department of Computer Science, The University of Hong Kong, Pokfulam
Francis Lau

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, W., Chen, M., Fan, J. (2004). HPL Performance Prevision to Intending System Improvement. In: Cao, J., Yang, L.T., Guo, M., Lau, F. (eds) Parallel and Distributed Processing and Applications. ISPA 2004. Lecture Notes in Computer Science, vol 3358. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30566-8_90

Download citation

DOI: https://doi.org/10.1007/978-3-540-30566-8_90
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24128-7
Online ISBN: 978-3-540-30566-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

HPL Performance Prevision to Intending System Improvement

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A Self-adaptive HPL-Based Benchmark with Dynamic Task Parallelism for Multicore Systems

A Survey About Quantitative Measurement of Performance Variability in High Performance Computers

Towards optimal scheduling policy for heterogeneous memory architecture in many-core system

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

HPL Performance Prevision to Intending System Improvement

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A Self-adaptive HPL-Based Benchmark with Dynamic Task Parallelism for Multicore Systems

A Survey About Quantitative Measurement of Performance Variability in High Performance Computers

Towards optimal scheduling policy for heterogeneous memory architecture in many-core system

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation