Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/182478.182587acmconferencesArticle/Chapter ViewAbstractPublication PagespadsConference Proceedingsconference-collections
Article
Free access

Cost/performance of a parallel computer simulator

Published: 01 July 1994 Publication History

Abstract

This paper examines the cost/performance of simulating a hypothetical target parallel computer using a commercial host parallel computer. We address the question of whether parallel simulation is simply faster than sequential simulation, or if it is also more cost-effective. To answer this, we develop a performance model of the Wisconsin Wind Tunnel (WWT), a system that simulates cache-coherent shared-memory machines on a message-passing Thinking Machines CM-5. The performance model uses Kruskal and Weiss's fork-join model to account for the effect of event processing time variability on WWT's conservative fixed-window simulation algorithm. A generalization of Thiebaut and Stone's footprint model accurately predicts the effect of cache interference on the CM-5. The model is calibrated using parameters extracted from a fully-parallel simulation (p=N), and validated by measuring the speedup as the number of processors (p) ranges from one to the number of target nodes (N. Together with simple cost models, the performance model indicates that for target system sizes of 32 nodes and larger, parallel simulation is more cost-effective than sequential simulation. The key intuition behind this result is that large simulations require large memories, which dominate the cost of a uniprocessor; parallel computers allow multiple processors to simultaneously access this large memory.

References

[1]
Rassul Ayani. A Parallel Simulation Scheme Based on the Distance Between Objects. In Proceedings of the SCS Mult~conference on Distmbuted Simulation, pages 113-118, March 1989.
[2]
David Bailey, John Barton, Thomas Lasinski, and Horst Simon. The NAS Parallel Benchmarks. Technical Report RNR-91-002 Revision 2, Ames Research Center, August 1991.
[3]
Bob Boothe. Fast Accurate Simulation of Large Shared Memory Multiprocessors. Technical Report CSD 92/682, Computer Science Division (EECS), University of California at Berkeley, January t992.
[4]
Eric A. Brewer, Chrysanthos N Dellarocas, Adrian CoIbrook, and William Weihl. PROTEUS: A High-Performance Parallel- Architecture Simulator. Technical Report MIT/LCS/TR-516, MIT Laboratory for Computer Science, September 1991.
[5]
Robert F. CmeIik and David Keppel. Shade: A Fast Instruction- Set Simulator for Execution Profiling. Technical Report UWCSE 93-06-06, Department of Computer Science, University of Washington, 1993.
[6]
R.C. Covington, S. Madala, V. Mehta, J.R. Jump, and J.B. Sinclair. The Rice Parallel Processing Testbed. In Proceedings of the 1988 A CM SIGMETRICS Conference on Measurement and Modeling of Con~pu~er Sy~tcrr~, pages 4-11, May 1988.
[7]
Helen Davis, Stephen R. Goldschmidt, and John Hennessy. Multiprocessor Simulation and Tracing Using Tango. In Proceedings of the 1991 International Conference on Parallel Processing (VoI. II Software), pages II99-107, August 1991.
[8]
Richard M. Fujimoto. Parallel Discrete Event Simulation. Communications of the ACM, 33(10):30-53, October 1990.
[9]
Mark D. Hill, James R. Larus, Steven K. Reinhardt, and David A. Wood. Cooperative Shared Memory: Software and Hardware for Scalable Multiprocessors. A CM Transactions on Computer Systems, 11(4):300-318, November 1993. Ealier version appeared in ASPLOS V, Oct. 1992.
[10]
Raj Jain. The Art of Computer Systems Performance Analysts: Technzques for Expemmental Design, Measurement, Simulation, and Modeling. John Wiley & Sons, 1991.
[11]
John L. Hennessy Jaswinder P. Singh and Anoop Gupta. Scaling Parallel Programs for Multiprocessors: Methodology and Examples. IEEE Computer, 26(7):42-50, July 1993.
[12]
C. P. Kruskal and A. Weiss. Allocating Independent Subtasks on Parallel Processors. IEEE Trans. on Software Engzneemng, 11(10):1001-1016, October 1985.
[13]
Charles E. Leiserson, Zahi S. Abuhamdeh, David C. Douglas, Carl g. Feynman, Mahesh N. Ganmukhi, Jeffrey V. Hill, W. Daniel Hillis, Bradley C. KuszmauI, Margaret A. St. Pierre, David S. Wells, Monica C. Wong, Shaw-Wen Yang, and Robert Zak. The Network Architecture of the Connection Machine CM- 5. In Proceedings of the Fifth A CM Symposium on Parallel Algorithms and Architectures (SPAA), July 1992.
[14]
Kai Li and Paul Hudak. Memory Coherence in Shared Virtual Memory Systems. A CM Transactions on Computer Systems, 7(4):321-359, November 1989.
[15]
Jia-Jen Lin. Eflfic~ent Parallel Simulation for Designing Multlprocessor System. PhD thesis, University of Michigan, Ann Arbor, 1992.
[16]
Boris D. Lubachevsky. Efficient Distributed Event-Driven Simulations of Multiple-Loop Networks. Communlcatwns of the ACM, 32(2):111-123, January 1989.
[17]
Jayadev Misra. Distributed-Discrete Event Simulation ACM Computing Surveys, 18(1):39-65, March 1986.
[18]
David Nicot. Conservative Parallel Simulation of Priority Class Queueing Networks. IEEE Transactions on Parallel and D~stributed Systems, 3(3):398-412, May 1992.
[19]
Ed Reidenbach. CHALLENGE Server Perdzod~c Table. Silicon Graphics Computer Systems, October 1993.
[20]
Steven K. Reinhardt, Mark D. Hill, James R. Larus, Alvin R. Lebeck, James C. Lewis, and David A. Wood. The Wisconsin Wind Tunnel: Virtual Prototyping of Parallel Computers. In Proceedings of the 1993 A CM Szgmetmcs Conference on Measurement and Modeling of Computer Systems, pages 48- 60, May 1993.
[21]
Jaswinder Pal Singh, Truman Joe, Anoop Gupta, and John L. Hennessy. An Empirical Comparison of the Kendall Square Research KSR-1 and Stanford DASH Multiprocessor. In Proceed-,ngs of Supercomputzng 93, pages 214-225, November 1993.
[22]
Jaswinder Pal Singh, Wolf-Dietrich Weber, and Anoop Gupta. SPLASH: Stanford Parallel Applications for Shared Memory. Computer Architecture News, 20(1):5-44, March 1992.
[23]
SPEC. SPEC Benchmark Suite Release 1.0, Winter 1990.
[24]
Jeff. S. Steinman. SPEEDES: A Multiple-Synchronization Environment for Parallel Discrete-Event Simulation. International Journal zn Computer Simulation, 2:251-286, 1992.
[25]
D. Thiebaut and H.S. Stone Footprints m the cache. A CM Transactions on Computer Systems, 5(4):305-329, November 1987.
[26]
David A. Wood, Satish Chandra, Babak Falsafi, Mark D. Hill, James R. Larus, Alvin R. Lebeck, James C. Lewis, Shubhendu S. Mukherjee, Subbarao Palacharla, and Steven K. Reinhardt. Mechanisms for Cooperative Shared Memory. In Proceedings of the 20th Annual International Symposium on Computer Architecture, pages 156-168, May 1993.

Cited By

View all
  • (2024)The Optimal Quantum of Temporal Decoupling2024 29th Asia and South Pacific Design Automation Conference (ASP-DAC)10.1109/ASP-DAC58780.2024.10473967(686-691)Online publication date: 22-Jan-2024
  • (1999)Profit-Effective Parallel ComputingIEEE Concurrency10.1109/4434.7669677:2(65-69)Online publication date: 1-Apr-1999
  • (1996)A model for parallel simulation of distributed shared memoryProceedings of the 4th International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunications Systems10.5555/823448.823570Online publication date: 1-Feb-1996
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
PADS '94: Proceedings of the eighth workshop on Parallel and distributed simulation
August 1994
196 pages
ISBN:1565550277
DOI:10.1145/182478

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 July 1994

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

8PADS94
Sponsor:
8PADS94: ACM/IEEE 8th Workshop on Parallel and Distributed
July 6 - 8, 1994
Edinburgh, Scotland, United Kingdom

Acceptance Rates

PADS '94 Paper Acceptance Rate 27 of 66 submissions, 41%;
Overall Acceptance Rate 398 of 779 submissions, 51%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)70
  • Downloads (Last 6 weeks)15
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)The Optimal Quantum of Temporal Decoupling2024 29th Asia and South Pacific Design Automation Conference (ASP-DAC)10.1109/ASP-DAC58780.2024.10473967(686-691)Online publication date: 22-Jan-2024
  • (1999)Profit-Effective Parallel ComputingIEEE Concurrency10.1109/4434.7669677:2(65-69)Online publication date: 1-Apr-1999
  • (1996)A model for parallel simulation of distributed shared memoryProceedings of the 4th International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunications Systems10.5555/823448.823570Online publication date: 1-Feb-1996
  • (1996)A performance analysis model for distributed simulationsProceedings of the 28th conference on Winter simulation10.1145/256562.256771(629-636)Online publication date: 8-Nov-1996
  • (1996)Optimistic simulation of parallel architectures using program executablesACM SIGSIM Simulation Digest10.1145/238793.23883826:1(143-150)Online publication date: 1-Jul-1996
  • (1996)Optimistic simulation of parallel architectures using program executablesProceedings of the tenth workshop on Parallel and distributed simulation10.1145/238788.238838(143-150)Online publication date: 1-Jul-1996
  • (1996)Optimistic Simulation of Parallel Architectures Using Program ExecutablesProceedings of Symposium on Parallel and Distributed Tools10.1109/PADS.1996.761572(143-150)Online publication date: 1996
  • (1996)A model for parallel simulation of distributed shared memoryProceedings of MASCOTS '96 - 4th International Workshop on Modeling, Analysis and Simulation of Computer and Telecommunication Systems10.1109/MASCOT.1996.501014(179-184)Online publication date: 1996
  • (1995)Accuracy vs. performance in parallel simulation of interconnection networksProceedings of 9th International Parallel Processing Symposium10.1109/IPPS.1995.395909(22-31)Online publication date: 1995
  • (1995)Cost-Effective Parallel ComputingComputer10.1109/2.34800228:2(69-72)Online publication date: 1-Feb-1995

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media