Fast Sparse Matrix-Vector Multiplication for TeraFlop/s Computers

Gerhard Wellein⁷,
Georg Hager⁷,
Achim Basermann⁸ &
…
Holger Fehske⁹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2565))

Included in the following conference series:

International Conference on High Performance Computing for Computational Science

768 Accesses
7 Citations

Abstract

Eigenvalue problems involving very large sparse matrices are common to various fields in science. In general, the numerical core of iterative eigenvalue algorithms is a matrix-vector multiplication (MVM) involving the large sparse matrix. We present three different programming approaches for parallel MVM on present day supercomputers. In addition to a pure message-passing approach, two hybrid parallel implementations are introduced based on simultaneous use of message-passing and shared-memory programming models. For a modern SMP cluster (HITACHI SR8000) performance and scalability of the hybrid implementations are discussed and compared with the pure message-passing approach on massively-parallel systems (CRAY T3E), vector computers (NEC SX5e) and distributed shared-memory systems (SGI Origin3800).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Software concepts and numerical algorithms for a scalable adaptive parallel finite element method

Article 29 January 2015

Fast multipole preconditioners for sparse matrices arising from elliptic equations

Article Open access 09 November 2017

A Blackbox Polynomial System Solver on Parallel Shared Memory Computers

References

S.W. Bova et al., The International Journal of High Performance Computing Applications, 14, pp. 49–60, 2000. 287
Article Google Scholar
L.A. Smith and P. Kent, Proceedings of the First European Workshop on OpenMP, Lund, Sweden, Sept. 1999, pp. 6–9. 287
Google Scholar
D. S. Henty, Performance of Hybrid Message-Passing and Shared-Memory Parallelism for Discrete Element Modelling. In Proceedings of SC2000, 2000. 287
Google Scholar
H. Shan et al., A Comparison of Three Programming Models for Adaptive Applications on the Origin2000. In Proceedings of SC2000, 2000. 287
Google Scholar
W.D. Gropp et al., Performance Modeling and Tuning of an Unstructured Mesh CF Application. In Proceedings of SC2000, 2000. 287
Google Scholar
R. Rabenseifner, Communication Bandwidth of Parallel Programming Models on Hybrid Architectures. To be published in the proceedings of WOMPEI 2002, Kansai Science City, Japan. LNCS 2327. 287
Google Scholar
J. Dongarra et al., Iterative Solver Benchmark, available at http://www.netlib.org/benchmark/sparsebench/. 287, 289
R. Barrett et al., Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods, SIAM, Philadelphia (1994). 288
Google Scholar
M. Kinateder et al., E. Krause and W. Jäger, eds.: High Performance Computing in Science and Engineering 2000, Springer, Berlin (2001), pp. 188–204. 288, 289
Google Scholar
W. Schönauer, Architecture and Use of Shared and Distributed Memory Parallel Computers, eds.: W. Schönauer, ISBN 3-00-005484-7. 296
Google Scholar
P.W. Anderson, Phys. Rev. B 109, 1492 (1958). 293
Article Google Scholar
G. Wellein et al., Exact Diagonalization of Large Sparse Matrices: A Challenge for Modern Supercomputers, In Proceedings of CUG SUMMIT 2001, CD-ROM. 297
Google Scholar
M. Brehm, LRZ Munich, private communication. 297
Google Scholar

Download references

Author information

Authors and Affiliations

Regionales Rechenzentrum Erlangen, D-91058, Erlangen, Germany
Gerhard Wellein & Georg Hager
C&,C Research Laboratories, NEC Europe Ltd, D-53757, Sankt Augustin, Germany
Achim Basermann
Institut für Physik, Universität Greifswald, D-17487, Greifswald, Germany
Holger Fehske

Authors

Gerhard Wellein
View author publications
You can also search for this author in PubMed Google Scholar
Georg Hager
View author publications
You can also search for this author in PubMed Google Scholar
Achim Basermann
View author publications
You can also search for this author in PubMed Google Scholar
Holger Fehske
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculdade de Engenharia da, Universidade do Porto, Rua Dr. Roberto Frias, 4200-465, Porto, Portugal
José M. L. M. Palma & A. Augusto Sousa &
Department of Computer Science, University of Tennessee, 37996-1301, Knoxville, TN, USA
Jack Dongarra
Departamento de Sistemas Informáticos y Computación, Universidad Politécnica de Valencia, Camino de Vera, s/n, Apartado 22012, 46020, Valencia, Spain
Vicente Hernández

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wellein, G., Hager, G., Basermann, A., Fehske, H. (2003). Fast Sparse Matrix-Vector Multiplication for TeraFlop/s Computers. In: Palma, J.M.L.M., Sousa, A.A., Dongarra, J., Hernández, V. (eds) High Performance Computing for Computational Science — VECPAR 2002. VECPAR 2002. Lecture Notes in Computer Science, vol 2565. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36569-9_18

Download citation

DOI: https://doi.org/10.1007/3-540-36569-9_18
Published: 15 April 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00852-1
Online ISBN: 978-3-540-36569-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Fast Sparse Matrix-Vector Multiplication for TeraFlop/s Computers

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Software concepts and numerical algorithms for a scalable adaptive parallel finite element method

Fast multipole preconditioners for sparse matrices arising from elliptic equations

A Blackbox Polynomial System Solver on Parallel Shared Memory Computers

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Fast Sparse Matrix-Vector Multiplication for TeraFlop/s Computers

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Software concepts and numerical algorithms for a scalable adaptive parallel finite element method

Fast multipole preconditioners for sparse matrices arising from elliptic equations

A Blackbox Polynomial System Solver on Parallel Shared Memory Computers

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation