Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/331532.331573acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
Article
Free access

Architectural requirements and scalability of the NAS parallel benchmarks

Published: 01 January 1999 Publication History
First page of PDF

References

[1]
Agarwal, A., Horowitz, M., and Hennessy, J., "An Analytical Cache Model". ACM Trans. on Comp. Sys., Vol.7, no.2, May 1989, pp. 184-215.
[2]
T.E.Anderson,D.E.Culler,D.A.Patterson,and the NOW Team. A Case for NOW (Networks of Workstations). IEEE Micro, February, 1995.
[3]
David H. Bailey, T. Harris, Rob Van der Wigngaart, William Saphir, Alex Woo, and Maurice Yarrow. The NAS Parallel Benchmarks 2.0. Te chnical Report NAS-95-010, NASA Ames Research Center, 1995.
[4]
Nanette J. Boden and Danny Cohen and Robert E. Felderman and Alan E. Kulawik and Charles L. Seitz and Jakov N. Seizovic and Wen-King Su. Myrinet - A Gigabet-per-Second Local-Area Network. IEEE Micro, Volume 15 number 1,Feb. 1995. pp.29-38.
[5]
Bob Cmelik and Doug Keppel. Shade: A Fast Instruction-set Simulator for Execution Profiling, In Proceedings of SIGNETRICS 94, pp. 128-137.
[6]
Leonardo Dagum, David H. Bailey, Eric Barszcz and Horst D. Simon. NAS Parallel Benchmarks Results. Technical Report RNR-93-016, NASA Ames Research Center, 1993.
[7]
Dongarra, and T. Dunnigan. Message Passing Performance of Various Computers. University of Tennessee Technical Report CS-95-299,May 1995.
[8]
T. von Eicken, D. Culler, S. Goldstein, and K. Schauser, ``Active Messages: a Mechanism for Integrated Communication and Computation'', In Proceedings of the 19th International Symposium on Computer Architecture, May 1992, Gold Coast, Qld., Australia, pp.256-266.
[9]
W. Gropp and E. Lusk and N. Doss and A. Skjellum. A high-performance, portable implementation of the (MPI) message passing interface standard. Parallel Computing 22(6):789-828, September 1996.
[10]
Mark Hill. The Dinero Cache Simulator. Aug. 1995, http://www.cs.wisc.edu/~larus/warts.html
[11]
A. Mainwaring. Active Message Application Programming Interface and Communication Subsystem Organization. University of California at Berkeley, Computer Science Department, Technical Report UCB CSD-96-918, October 1996.
[12]
Richard P. Martin, Amin M. Vahdat, David E. Culler, and Thomas E. Anderson. The Effects of Latency, Overhead and Bandwidth in a Cluster Architecture. In Proceedings of the 24th International Symposium on Computer Architecture, June 1997.
[13]
Message Passing Interface Forum. The MPI Message Passing Interface Standard. Technical Report, University of Tennessee, Knoxville, April 1994.
[14]
NASA Ames Research Center. NPB 2 Detailed Results, 1997. http://science.nas.nasa.gov/Software/NPB/ NPB2Results.
[15]
Steven K. Reinhardt and Mark D. Hill and James R. Larus and Alvin R. Lebeck and James C. Lewis and David A. Wood. The Wisconsin Wind Tunnel: Virtual Prototyping of Parallel Computers, In SIGMETRICS 93. May, 1993.
[16]
Edward Rothberg, Jaswinder Pal Singh, and Anoop Gupta. Working Sets, Cache Sizes and Node Granularity Issues for Large Scale Multiprocessors. In Proceedings of the 20th International Symposium on Computer Architecture, pages 14-25, May 1993.
[17]
Saavedra-Barrera, R.H., CPU Performance Evaluation and Execution Time Prediction Using Narrow Spectrum Benchmarking, Ph.D. Thesis, UC Berkeley, Technical Report No. UCB/CSD 92/ 684, February 1992.
[18]
William Saphir, Alex Woo, and Maurice Yarrow. NAS Parallel Benchmark 2.1 Results. Techni cal Report NAS-96-010, NASA Ames Research Center, 1996.
[19]
Elisabeth Wechsler. NAS Parallel Benchmarks Set The Industry Standard for MPP Performance. NAS News, Jan - Feb, Volume2, Number 8, 1995. http://science.nas.nasa.gov/Pubs/NAnews/98/01/ Benchmark.html
[20]
Steven Cameron Woo, Moriwoshi Ohara, Evan Torrie, Jaswinder Pal Singh, and Anoop Gupta. The SPLASH-2 Programs: Characterization and Methodological Considerations. In Proceedings of the 22nd International Symposium on Computer Architecture, pages 24-36, June 1995.
[21]
Maurice Yarrow and Rob Van der Wijngaart. Communication Improvement for the LU NAS Parallel Benchmark: A Model for Efficient Parallel Relaxation Schemes. Technical Report NAS- 97-032, NASA Ames Research Center, November 1997.

Cited By

View all
  • (2024)Software Resource Disaggregation for HPC with Serverless Computing2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS57955.2024.00021(139-156)Online publication date: 27-May-2024
  • (2020)Understanding the use of message passing interface in exascale proxy applicationsConcurrency and Computation: Practice and Experience10.1002/cpe.590133:14Online publication date: 17-Aug-2020
  • (2018)BookLeaf: An Unstructured Hydrodynamics Mini-Application2018 IEEE International Conference on Cluster Computing (CLUSTER)10.1109/CLUSTER.2018.00078(615-622)Online publication date: Sep-2018
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SC '99: Proceedings of the 1999 ACM/IEEE conference on Supercomputing
January 1999
1015 pages
ISBN:1581130910
DOI:10.1145/331532
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 January 1999

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

SC '99
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)73
  • Downloads (Last 6 weeks)8
Reflects downloads up to 14 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Software Resource Disaggregation for HPC with Serverless Computing2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS)10.1109/IPDPS57955.2024.00021(139-156)Online publication date: 27-May-2024
  • (2020)Understanding the use of message passing interface in exascale proxy applicationsConcurrency and Computation: Practice and Experience10.1002/cpe.590133:14Online publication date: 17-Aug-2020
  • (2018)BookLeaf: An Unstructured Hydrodynamics Mini-Application2018 IEEE International Conference on Cluster Computing (CLUSTER)10.1109/CLUSTER.2018.00078(615-622)Online publication date: Sep-2018
  • (2017)Main Memory in HPCACM Transactions on Architecture and Code Optimization10.1145/302336214:1(1-26)Online publication date: 6-Mar-2017
  • (2017)Analyzing the impact of communication imbalance in high‐speed networksConcurrency and Computation: Practice and Experience10.1002/cpe.439430:7Online publication date: 20-Dec-2017
  • (2016)On the performance and energy efficiency of the PGAS programming model on multicore architectures2016 International Conference on High Performance Computing & Simulation (HPCS)10.1109/HPCSim.2016.7568416(800-807)Online publication date: Jul-2016
  • (2016)Automatic communication optimization of parallel applications in public cloudsProceedings of the 16th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing10.1109/CCGrid.2016.59(1-10)Online publication date: 16-May-2016
  • (2016)System-Level Transparent Checkpointing for OpenSHMEMOpenSHMEM and Related Technologies. Enhancing OpenSHMEM for Hybrid Environments10.1007/978-3-319-50995-2_4(52-65)Online publication date: 15-Dec-2016
  • (2016)A Comparative Study of Application Performance and Scalability on the Intel Knights Landing ProcessorHigh Performance Computing10.1007/978-3-319-46079-6_22(307-318)Online publication date: 6-Oct-2016
  • (2015)Experimental Results of a Raspberry Pi Based WMN Testbed for Multiple Flows and Distributed Concurrent ProcessingProceedings of the 2015 10th International Conference on Broadband and Wireless Computing, Communication and Applications (BWCCA)10.1109/BWCCA.2015.95(201-206)Online publication date: 4-Nov-2015
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media