Article

Free access

Architectural requirements and scalability of the NAS parallel benchmarks

Authors:

Frederick C. Wong,

Richard P. Martin,

Remzi H. Arpaci-Dusseau,

David E. CullerAuthors Info & Claims

SC '99: Proceedings of the 1999 ACM/IEEE conference on Supercomputing

Pages 41 - es

https://doi.org/10.1145/331532.331573

Published: 01 January 1999 Publication History

PDF eReader

References

[1]

Agarwal, A., Horowitz, M., and Hennessy, J., "An Analytical Cache Model". ACM Trans. on Comp. Sys., Vol.7, no.2, May 1989, pp. 184-215.

Digital Library

Google Scholar

[2]

T.E.Anderson,D.E.Culler,D.A.Patterson,and the NOW Team. A Case for NOW (Networks of Workstations). IEEE Micro, February, 1995.

Digital Library

Google Scholar

[3]

David H. Bailey, T. Harris, Rob Van der Wigngaart, William Saphir, Alex Woo, and Maurice Yarrow. The NAS Parallel Benchmarks 2.0. Te chnical Report NAS-95-010, NASA Ames Research Center, 1995.

Google Scholar

[4]

Nanette J. Boden and Danny Cohen and Robert E. Felderman and Alan E. Kulawik and Charles L. Seitz and Jakov N. Seizovic and Wen-King Su. Myrinet - A Gigabet-per-Second Local-Area Network. IEEE Micro, Volume 15 number 1,Feb. 1995. pp.29-38.

Digital Library

Google Scholar

[5]

Bob Cmelik and Doug Keppel. Shade: A Fast Instruction-set Simulator for Execution Profiling, In Proceedings of SIGNETRICS 94, pp. 128-137.

Digital Library

Google Scholar

[6]

Leonardo Dagum, David H. Bailey, Eric Barszcz and Horst D. Simon. NAS Parallel Benchmarks Results. Technical Report RNR-93-016, NASA Ames Research Center, 1993.

Google Scholar

[7]

Dongarra, and T. Dunnigan. Message Passing Performance of Various Computers. University of Tennessee Technical Report CS-95-299,May 1995.

Digital Library

Google Scholar

[8]

T. von Eicken, D. Culler, S. Goldstein, and K. Schauser, ``Active Messages: a Mechanism for Integrated Communication and Computation'', In Proceedings of the 19th International Symposium on Computer Architecture, May 1992, Gold Coast, Qld., Australia, pp.256-266.

Digital Library

Google Scholar

[9]

W. Gropp and E. Lusk and N. Doss and A. Skjellum. A high-performance, portable implementation of the (MPI) message passing interface standard. Parallel Computing 22(6):789-828, September 1996.

Digital Library

Google Scholar

[10]

Mark Hill. The Dinero Cache Simulator. Aug. 1995, http://www.cs.wisc.edu/~larus/warts.html

Google Scholar

[11]

A. Mainwaring. Active Message Application Programming Interface and Communication Subsystem Organization. University of California at Berkeley, Computer Science Department, Technical Report UCB CSD-96-918, October 1996.

Digital Library

Google Scholar

[12]

Richard P. Martin, Amin M. Vahdat, David E. Culler, and Thomas E. Anderson. The Effects of Latency, Overhead and Bandwidth in a Cluster Architecture. In Proceedings of the 24th International Symposium on Computer Architecture, June 1997.

Digital Library

Google Scholar

[13]

Message Passing Interface Forum. The MPI Message Passing Interface Standard. Technical Report, University of Tennessee, Knoxville, April 1994.

Digital Library

Google Scholar

[14]

NASA Ames Research Center. NPB 2 Detailed Results, 1997. http://science.nas.nasa.gov/Software/NPB/ NPB2Results.

Google Scholar

[15]

Steven K. Reinhardt and Mark D. Hill and James R. Larus and Alvin R. Lebeck and James C. Lewis and David A. Wood. The Wisconsin Wind Tunnel: Virtual Prototyping of Parallel Computers, In SIGMETRICS 93. May, 1993.

Digital Library

Google Scholar

[16]

Edward Rothberg, Jaswinder Pal Singh, and Anoop Gupta. Working Sets, Cache Sizes and Node Granularity Issues for Large Scale Multiprocessors. In Proceedings of the 20th International Symposium on Computer Architecture, pages 14-25, May 1993.

Digital Library

Google Scholar

[17]

Saavedra-Barrera, R.H., CPU Performance Evaluation and Execution Time Prediction Using Narrow Spectrum Benchmarking, Ph.D. Thesis, UC Berkeley, Technical Report No. UCB/CSD 92/ 684, February 1992.

Digital Library

Google Scholar

[18]

William Saphir, Alex Woo, and Maurice Yarrow. NAS Parallel Benchmark 2.1 Results. Techni cal Report NAS-96-010, NASA Ames Research Center, 1996.

Google Scholar

[19]

Elisabeth Wechsler. NAS Parallel Benchmarks Set The Industry Standard for MPP Performance. NAS News, Jan - Feb, Volume2, Number 8, 1995. http://science.nas.nasa.gov/Pubs/NAnews/98/01/ Benchmark.html

Google Scholar

[20]

Steven Cameron Woo, Moriwoshi Ohara, Evan Torrie, Jaswinder Pal Singh, and Anoop Gupta. The SPLASH-2 Programs: Characterization and Methodological Considerations. In Proceedings of the 22nd International Symposium on Computer Architecture, pages 24-36, June 1995.

Digital Library

Google Scholar

[21]

Maurice Yarrow and Rob Van der Wijngaart. Communication Improvement for the LU NAS Parallel Benchmark: A Model for Efficient Parallel Relaxation Schemes. Technical Report NAS- 97-032, NASA Ames Research Center, November 1997.

Google Scholar

Cited By

View all

Copik MChrapek MSchmid LCalotoiu AHoefler T(2024)Software Resource Disaggregation for HPC with Serverless Computing2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS57955.2024.00021(139-156)Online publication date: 27-May-2024
https://doi.org/10.1109/IPDPS57955.2024.00021
Sultana NRüfenacht MSkjellum ABangalore PLaguna IMohror K(2020)Understanding the use of message passing interface in exascale proxy applicationsConcurrency and Computation: Practice and Experience10.1002/cpe.590133:14Online publication date: 17-Aug-2020
https://doi.org/10.1002/cpe.5901
Truby DWright SKevis RMaheswaran SHerdman AJarvis S(2018)BookLeaf: An Unstructured Hydrodynamics Mini-Application2018 IEEE International Conference on Cluster Computing (CLUSTER)10.1109/CLUSTER.2018.00078(615-622)Online publication date: Sep-2018
https://doi.org/10.1109/CLUSTER.2018.00078
Show More Cited By

Index Terms

Architectural requirements and scalability of the NAS parallel benchmarks

Recommendations

Tools-supported HPF and MPI parallelization of the NAS parallel benchmarks
FRONTIERS '96: Proceedings of the 6th Symposium on the Frontiers of Massively Parallel Computation

High Performance Fortran (HPF) compilers and communication libraries with the standardized Message Passing Interface (MPI) are becoming widely available, easing the development of portable parallel applications. The Annai tool environment supports ...
XcalableMP implementation and performance of NAS Parallel Benchmarks
PGAS '10: Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model

XcalableMP is a parallel extension of existing languages, such as C and Fortran, that was proposed as a new programming model to facilitate program parallel applications for distributed memory systems. In order to investigate the performance of parallel ...
Performance characteristics of the multi-zone NAS parallel benchmarks
Special issue: 18^th International parallel and distributed processing symposium

We describe a new suite of computational benchmarks that models applications featuring multiple levels of parallelism. Such parallelism is often available in realistic flow computations on systems of meshes, but had not previously been captured in ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

SC '99: Proceedings of the 1999 ACM/IEEE conference on Supercomputing

January 1999

1015 pages

ISBN:1581130910

DOI:10.1145/331532

General Chair:
Cherri Pancake

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 January 1999

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Article

Conference

SC '99

Sponsor:

SIGARCH
IEEE-CS

SC '99: International Conference for High Performance Computing, Networking, Storage and Analysis

November 14 - 19, 1999

Oregon, Portland, USA

Acceptance Rates

Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

65
Total Citations
View Citations
679
Total Downloads

Downloads (Last 12 months)73
Downloads (Last 6 weeks)8

Reflects downloads up to 14 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Copik MChrapek MSchmid LCalotoiu AHoefler T(2024)Software Resource Disaggregation for HPC with Serverless Computing2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS57955.2024.00021(139-156)Online publication date: 27-May-2024
https://doi.org/10.1109/IPDPS57955.2024.00021
Sultana NRüfenacht MSkjellum ABangalore PLaguna IMohror K(2020)Understanding the use of message passing interface in exascale proxy applicationsConcurrency and Computation: Practice and Experience10.1002/cpe.590133:14Online publication date: 17-Aug-2020
https://doi.org/10.1002/cpe.5901
Truby DWright SKevis RMaheswaran SHerdman AJarvis S(2018)BookLeaf: An Unstructured Hydrodynamics Mini-Application2018 IEEE International Conference on Cluster Computing (CLUSTER)10.1109/CLUSTER.2018.00078(615-622)Online publication date: Sep-2018
https://doi.org/10.1109/CLUSTER.2018.00078
Zivanovic DPavlovic MRadulovic MShin HSon JMckee SCarpenter PRadojković PAyguadé E(2017)Main Memory in HPCACM Transactions on Architecture and Code Optimization10.1145/302336214:1(1-26)Online publication date: 6-Mar-2017
https://dl.acm.org/doi/10.1145/3023362
Utrera GGil MMartorell X(2017)Analyzing the impact of communication imbalance in high‐speed networksConcurrency and Computation: Practice and Experience10.1002/cpe.439430:7Online publication date: 20-Dec-2017
https://doi.org/10.1002/cpe.4394
Lagraviere JLangguth JSourouri MHa PCai X(2016)On the performance and energy efficiency of the PGAS programming model on multicore architectures2016 International Conference on High Performance Computing & Simulation (HPCS)10.1109/HPCSim.2016.7568416(800-807)Online publication date: Jul-2016
https://doi.org/10.1109/HPCSim.2016.7568416
Carreño EDiener MCruz ENavaux PVarela CCastro HBarrios C(2016)Automatic communication optimization of parallel applications in public cloudsProceedings of the 16th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing10.1109/CCGrid.2016.59(1-10)Online publication date: 16-May-2016
https://dl.acm.org/doi/10.1109/CCGrid.2016.59
Garg RVienne JCooperman G(2016)System-Level Transparent Checkpointing for OpenSHMEMOpenSHMEM and Related Technologies. Enhancing OpenSHMEM for Hybrid Environments10.1007/978-3-319-50995-2_4(52-65)Online publication date: 15-Dec-2016
https://doi.org/10.1007/978-3-319-50995-2_4
Rosales CCazes JMilfeld KGómez-Iglesias AKoesterke LHuang LVienne J(2016)A Comparative Study of Application Performance and Scalability on the Intel Knights Landing ProcessorHigh Performance Computing10.1007/978-3-319-46079-6_22(307-318)Online publication date: 6-Oct-2016
https://doi.org/10.1007/978-3-319-46079-6_22
Oda TElmazi DIshitaki TBarolli AMatsuo KBarolli L(2015)Experimental Results of a Raspberry Pi Based WMN Testbed for Multiple Flows and Distributed Concurrent ProcessingProceedings of the 2015 10th International Conference on Broadband and Wireless Computing, Communication and Applications (BWCCA)10.1109/BWCCA.2015.95(201-206)Online publication date: 4-Nov-2015
https://dl.acm.org/doi/10.1109/BWCCA.2015.95
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Cited By

Index Terms

Recommendations

Tools-supported HPF and MPI parallelization of the NAS parallel benchmarks

XcalableMP implementation and performance of NAS Parallel Benchmarks

Performance characteristics of the multi-zone NAS parallel benchmarks