Implications of Memory Performance for Highly Efficient Supercomputing of Scientific Applications

Akihiro Musa^22,23,
Hiroyuki Takizawa²²,
Koki Okabe²²,
Takashi Soga²⁴ &
…
Hiroaki Kobayashi²²

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4330))

Included in the following conference series:

International Symposium on Parallel and Distributed Processing and Applications

618 Accesses
4 Citations

Abstract

This paper examines the memory performance of the vector-parallel and scalar-parallel computing platforms across five applications of three scientific areas; electromagnetic analysis, CFD/heat analysis, and seismology. Our evaluation results show that the vector platforms can achieve the high computational efficiency and hence significantly outperform the scalar platforms in the areas of these applications. We did exhaustive experiments and quantitatively evaluated representative scalar and vector platforms using real applications from the viewpoint of the system designers and developers. These results demonstrate that the ratio of memory bandwidth to floating-point operation rate needs to reach 4-bytes/flop to preserve the computational performance with hiding the memory access latencies by pipelined vector operations in the vector platforms. We also confirm that the enough number of memory banks to handle stride memory accesses leads to an increase in the execution efficiency. On the scalar platforms, the cache hit rate needs to be almost 100% to achieve the high computational efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Potential of a modern vector supercomputer for practical applications: performance evaluation of SX-ACE

Article Open access 07 March 2017

SX-ACE, Brand-New Vector Supercomputer for Higher Sustained Performance I

The Brand-New Vector Supercomputer, SX-ACE

References

Shingu, S., et al.: A 26.58 Tflops Global Atmospheric Simulation with the Spectral Transform Method on the Earth Simulator. In: Proceedings of the ACM/IEEE SC 2002 conference (2002)
Google Scholar
Yokokawa, M., et al.: 16.4-Tflops Direct Numerical Simulation of Turbulence by a Fourier Spectral Method on the Earth. In: Proceedings of the ACM/IEEE SC 2002 conference (2002)
Google Scholar
Oliker, L., et al.: Evaluation of Cache-based Superscalar and Cacheless Vector Architectures for Scientific Computations. In: Proceedings of the ACM/IEEE SC 2003 conference (2003)
Google Scholar
Oliker, L., et al.: Scientific Computations on Modern Parallel Vector System. In: Proceedings of the ACM/IEEE SC 2004 conference (2004)
Google Scholar
Fatoohi, R.A.: Vector Performance Analysis of Three Supercomputers: Cray-2, Cray Y-MP, and ETA10-Q. In: Proceedings of Supercomputing 1989 (1989)
Google Scholar
Fatoohi, R.A.: Vector Performance Analysis of The NEC SX-2. In: Proceedings of Supercomputing 1990 (1990)
Google Scholar
Shan, H., et al.: Performance Characteristics of the Cray X1 and Their Implications for Application Performance Tuning. In: Proceedings of the ICS 2004 (2004)
Google Scholar
Kitagawa, K., et al.: A Hardware Overview of SX-6 and SX-7 Supercomputer. NEC Research & Development 44, 2–7 (2003)
Google Scholar
Senta, T., et al.: Itanium2 32-way Server System Architecture. NEC Research & Development 44, 8–12 (2003)
Google Scholar
Kobayashi, T., et al.: FDTD simulation on array antenna SAR-GPR for land mine detection. In: Proceeding of SSR 2003: 1st International Symposium on Systems and Human Science, Osaka, Japan, November 2003, pp. 279–283 (2003)
Google Scholar
Kunz, K.S., Luebbers, R.J.: The Finite Difference Time Domain Method for Electromagnetics. CRC Press, Boca Raton (1993)
Google Scholar
Takagi, Y., et al.: Study of High Gain and Broadband Antipodal Fermi Anenna with Corrugation. In: 2004 International Symposium on Antennas and Propagation, vol. 1, pp. 69–72 (2004)
Google Scholar
Tsuboi, K., Masuya, G.: Direct Numerical Simulations for Instabilities of Remixed Planar Flames. In: The Fourth Asia-Pacific Conference on Combustion, Nanjing, China (November 2003)
Google Scholar
Nakajima, M., et al.: Numerical Simulation of Three-Dimensional Separated Flow and Heat Transfer around Staggerd Surface-Mounted Rectangular Blocks in a Channel. Numerical Heat Transfer, Part A 47, 691–708 (2005)
Article Google Scholar
Ariyoshi, K., et al.: Spatial variation in propagation speed of postseismic slip on the subducting plate boundary. In: 2nd Water Dynamics, vol. B-30, Sendai, Japan (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Tohoku University, Sendai, 980-6025, Japan
Akihiro Musa, Hiroyuki Takizawa, Koki Okabe & Hiroaki Kobayashi
NEC Corporation, Tokyo, 108-8001, Japan
Akihiro Musa
NEC System Tecnologies, Osaka, 540-8551, Japan
Takashi Soga

Authors

Akihiro Musa
View author publications
You can also search for this author in PubMed Google Scholar
Hiroyuki Takizawa
View author publications
You can also search for this author in PubMed Google Scholar
Koki Okabe
View author publications
You can also search for this author in PubMed Google Scholar
Takashi Soga
View author publications
You can also search for this author in PubMed Google Scholar
Hiroaki Kobayashi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Shanghai Jiao Tong University, 200030, Shanghai, China
Minyi Guo
Department of Computer Science, St. Francis Xavier University, Antigonish, Canada
Laurence T. Yang
Dipartimento di Ingegneria dell’ Informazione - Second, University of Naples - Italy, Real Casa dell’Annunziata, via Roma, 29 81031, Aversa (CE), Italy
Beniamino Di Martino
Institute of Scientific Computing, University of Vienna, Nordbergstr. 15/C/3, A-1090, Vienna, Austria/JPL, Caltech, USA
Hans P. Zima
Computer Science Department, University of Tennessee, TN 37996-3450, Knoxville, USA
Jack Dongarra
Grid Computing Center, Shanghai Jiao Tong University, 800 Dongchuan Road, 200240, Shanghai, China
Feilong Tang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Musa, A., Takizawa, H., Okabe, K., Soga, T., Kobayashi, H. (2006). Implications of Memory Performance for Highly Efficient Supercomputing of Scientific Applications. In: Guo, M., Yang, L.T., Di Martino, B., Zima, H.P., Dongarra, J., Tang, F. (eds) Parallel and Distributed Processing and Applications. ISPA 2006. Lecture Notes in Computer Science, vol 4330. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11946441_76

Download citation

DOI: https://doi.org/10.1007/11946441_76
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68067-3
Online ISBN: 978-3-540-68070-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Implications of Memory Performance for Highly Efficient Supercomputing of Scientific Applications

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Potential of a modern vector supercomputer for practical applications: performance evaluation of SX-ACE

SX-ACE, Brand-New Vector Supercomputer for Higher Sustained Performance I

The Brand-New Vector Supercomputer, SX-ACE

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Implications of Memory Performance for Highly Efficient Supercomputing of Scientific Applications

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Potential of a modern vector supercomputer for practical applications: performance evaluation of SX-ACE

SX-ACE, Brand-New Vector Supercomputer for Higher Sustained Performance I

The Brand-New Vector Supercomputer, SX-ACE

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation