Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/29903.29912acmconferencesArticle/Chapter ViewAbstractPublication PagesmetricsConference Proceedingsconference-collections
Article
Free access

Memory access patterns of parallel scientific programs

Published: 01 May 1987 Publication History

Abstract

A parallel simulator, PSIMUL, has been used to collect information on the memory access patterns and synchronization overheads of several scientific applications. The parallel simulation method we use is very efficient and it allows us to simulate execution of an entire application program, amounting to hundreds of millions of instructions. We present our measurements on the memory access characteristics of these applications; particularly our observations on shared and private data, their frequency of access and locality. We have found that, even though the shared data comprise the largest portion of the data in the application program, on the average a small fraction of the memory references are to shared data. The low averages do not preclude bursts of traffic to shared memory nor does it rule out positive benefits from caching shared data. We also discuss issues of synchronization overheads and their effect on performance.

References

[1]
F. Darema-Rogers, D.A. George, A. Norton, G.F. Pfister "VM/EPEX - A VM Based Environment for Parallel Execution", IBM Research RCl1381, 1985.
[2]
F. Darema-Rogers, A. Norton, G.F. Pfister "Using a Single-Program- Multiple-Data Computational Model for Parallel Execution of Scientific Applications", IBM Research RC11552, 1985.
[3]
A. Gottlieb "WASHCLOTH- The Logical Successor to SOAPSUDS", Ultracomputer Note #12, Courant Institute, NYU, Dec. 1980.
[4]
T.Y. Kwok and F. Darema-Rogers (unpublished program).
[5]
V.A. Norton and G.F. Pfister, "A Methodology for Predicting Multiprocessor Performance", IEEE, Proceedings, ICPP 1985 (Aug. 1985)pp772-781.
[6]
V.A. Norton and A. Silberger, "Parallelization and Performance Prediction of the Cooley-Tuckey FFT Algorithm for Shared-Memory Architectures", IBM Research RC11237, 1985.
[7]
A. Liles and G. Paul, SEMUL, developed at IBM Research to trace execution of scientific programs.
[8]
G.F. Pfister and V.A. Norton "Hot Spot Contention and Combining in Multistage Interconnection Networks" IEEE Trans. on Computers, vol. c-34, no. 10, (Oct 1985) pp943-948.
[9]
G.F. Pfister, W.C. Brantley, D.A. George, S.L. Harvey, W.J. Kleinfelder, K.P. McAuliffe, E.A. Melton, A. Norton, and j. Weiss, "The IBM Research Parallel Processor Prototype (RP3) : Introduction and Architecture", IEEE, Proceedings, ICPP 1985 (Aug. 1985) pp764-771.
[10]
T.H. Pulliam, and J. L. Steger, "Implicit Finite-Difference Simulations of Three- Dimensional Compresible Flow", AIAA Journal, vol 18, Feb. 1980, p 159.
[11]
K. So, F. Darema-Rogers, D. George, V. A. Norton and G. F. Pfister, "PSIMUL - A System For Parallel Simulation of the Execution of Parallel Programs", IBM Research Report RC11674, 1986.

Cited By

View all
  • (2021)SOFTWARE‐DEFINED ENVIRONMENTSData Center Handbook10.1002/9781119597537.ch8(143-153)Online publication date: 23-Apr-2021
  • (2014)Trace-Driven Memory Access Pattern Recognition in Computational KernelsProceedings of the Second Workshop on Optimizing Stencil Computations10.1145/2686745.2686748(25-32)Online publication date: 20-Oct-2014
  • (2014)Exploiting Emergence in On-Chip InterconnectsIEEE Transactions on Computers10.1109/TC.2012.27363:3(570-582)Online publication date: 1-Mar-2014
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMETRICS '87: Proceedings of the 1987 ACM SIGMETRICS conference on Measurement and modeling of computer systems
August 1987
267 pages
ISBN:089791225X
DOI:10.1145/29903
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 May 1987

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

SIGMETRICS87
Sponsor:

Acceptance Rates

Overall Acceptance Rate 459 of 2,691 submissions, 17%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)55
  • Downloads (Last 6 weeks)7
Reflects downloads up to 20 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2021)SOFTWARE‐DEFINED ENVIRONMENTSData Center Handbook10.1002/9781119597537.ch8(143-153)Online publication date: 23-Apr-2021
  • (2014)Trace-Driven Memory Access Pattern Recognition in Computational KernelsProceedings of the Second Workshop on Optimizing Stencil Computations10.1145/2686745.2686748(25-32)Online publication date: 20-Oct-2014
  • (2014)Exploiting Emergence in On-Chip InterconnectsIEEE Transactions on Computers10.1109/TC.2012.27363:3(570-582)Online publication date: 1-Mar-2014
  • (2013)MVA-Based Probabilistic Model of Shared Memory with a Round Robin Arbiter for Predicting Performance with Heterogeneous WorkloadProceedings of the International Conference on Multicore Software Engineering, Performance, and Tools - Volume 806310.1007/978-3-642-39955-8_2(13-24)Online publication date: 19-Aug-2013
  • (2007)Analysis of input-dependent program behavior using active profilingProceedings of the 2007 workshop on Experimental computer science10.1145/1281700.1281705(5-es)Online publication date: 13-Jun-2007
  • (2007)Characteristics of workloads used in high performance and technical computingProceedings of the 21st annual international conference on Supercomputing10.1145/1274971.1274984(73-82)Online publication date: 17-Jun-2007
  • (2006)Program phase detection and exploitationProceedings of the 20th international conference on Parallel and distributed processing10.5555/1898699.1898796(279-279)Online publication date: 25-Apr-2006
  • (2006)Program phase detection and exploitationProceedings 20th IEEE International Parallel & Distributed Processing Symposium10.1109/IPDPS.2006.1639570(8 pp.)Online publication date: 2006
  • (2005)Quantifying Locality In The Memory Access Patterns of HPC ApplicationsProceedings of the 2005 ACM/IEEE conference on Supercomputing10.1109/SC.2005.59Online publication date: 12-Nov-2005
  • (2005)Parallel application characterization for multiprocessor scheduling policy designJob Scheduling Strategies for Parallel Processing10.1007/BFb0022294(175-199)Online publication date: 15-Jun-2005
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media