Fast computer simulation is an essential tool in the design of large parallel computers. Our Fast Accurate Simulation Tool, FAST, is able to accurately simulate large shared memory multiprocessors and their execution of parallel applications at simulation speeds that are one to two orders of magnitude faster than previous comparable simulators. The key ideas involve execution driven simulation techniques that modify the object code of the application program being studied. This produces an augmented version of the code that is directly executed and performs much of the work of the simulation. We extend the previous work by introducing several new uses of code augmentation. In this paper we summarize the tradeoffs made in the designs of this and previous simulators. In previous simulators, these tradeoffs have often led to sacrificing accuracy for faster simulation. However by careful selection of techniques and when to apply them, we have built a simulator that is both faster and more accurate than previous simulation systems. The improved accuracy comes from applying code augmentation techniques at a uniform low level and from having such fast context switching that accuracy/performance tradeoffs become unnecessary. Our simulator has a modular design and has been configured in many ways. It has been used to conduct numerous experiments on multithreaded machine behavior, application behavior, cache behavior, compiler optimization, and traffic patterns. Because of its high performance, we have been able to perform simulations of larger machines than would otherwise have been feasible.
Cited By
- Falsafi B and Wood D (2018). Modeling cost/performance of a parallel computer simulator, ACM Transactions on Modeling and Computer Simulation (TOMACS), 7:1, (104-130), Online publication date: 1-Jan-1997.
- Pimentel A and Hertzberger L An Architecture Workbench for Multicomputers Proceedings of the 11th International Symposium on Parallel Processing, (94-99)
- Park D and Saavedra R Trojan Proceedings of the 29th Annual Simulation Symposium (SS '96)
- Falsafi B and Wood D Cost/performance of a parallel computer simulator Proceedings of the eighth workshop on Parallel and distributed simulation, (173-182)
- Falsafi B and Wood D (2019). Cost/performance of a parallel computer simulator, ACM SIGSIM Simulation Digest, 24:1, (173-182), Online publication date: 1-Jul-1994.
- Reinhardt S, Hill M, Larus J, Lebeck A, Lewis J and Wood D The Wisconsin Wind Tunnel Proceedings of the 1993 ACM SIGMETRICS conference on Measurement and modeling of computer systems, (48-60)
- Reinhardt S, Hill M, Larus J, Lebeck A, Lewis J and Wood D (1993). The Wisconsin Wind Tunnel, ACM SIGMETRICS Performance Evaluation Review, 21:1, (48-60), Online publication date: 1-Jun-1993.
Recommendations
Conservative circuit simulation on shared-memory multiprocessors
We investigate conservative parallel discrete event simulations for logical circuits on shared-memory multiprocessors. For a first estimation of the possible speedup, we extend the critical path analysis technique by partitioning strategies. To ...
Conservative circuit simulation on shared-memory multiprocessors
PADS '96: Proceedings of the tenth workshop on Parallel and distributed simulationWe investigate conservative parallel discrete event simulations for logical circuits on shared-memory multiprocessors. For a first estimation of the possible speedup, we extend the critical path analysis technique by partitioning strategies. To ...
Fast and accurate DRAM simulation: can we further accelerate it?
DATE '20: Proceedings of the 23rd Conference on Design, Automation and Test in EuropeThe simulation of Dynamic Random Access Memories (DRAMs) in a system context requires highly accurate models due to the complex timing and power behavior of DRAMs. However, cycle accurate DRAM models often become the bottleneck regarding the overall ...