New techniques for simulating high performance MPI applications on large storage networks

Alberto Núñez¹,
Javier Fernández¹,
Jose D. Garcia¹,
Félix Garcia¹ &
…
Jesús Carretero¹

188 Accesses
30 Citations
Explore all metrics

Abstract

In this work, we propose new techniques to analyze the behavior, the performance, and specially the scalability of High Performance Computing (in short, HPC) applications on different computing architectures. Our final objective is to test applications using a wide range of architectures (real or merely designed) and scaling it to any number of nodes or components. This paper presents a new simulation framework, called SIMCAN, for HPC architectures. The main characteristic of the proposed simulation framework is the ability to be configured for simulating a wide range of possible architectures that involve any number of components. SIMCAN is developed to simulate complete HPC architectures, but putting special emphasis on the storage and network subsystems. The SIMCAN framework can handle complete components (nodes, racks, switches, routers, etc.), but also key elements of the storage and network subsystems (disks, caches, sockets, file systems, schedulers, etc.). We also propose several methods to implement the behavior of HPC applications. Each method has its own advantages and drawbacks. In order to evaluate the possibilities and the accuracy of the SIMCAN framework, we have tested it by executing a HPC application called BIPS3D on a hardware-based computing cluster and on a modeled environment that represent the real cluster. We also checked the scalability of the application using this kind of architecture by simulating the same application with an increased number of computing nodes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Enabling Multi-level Network Modeling in Structural Simulation Toolkit for Next-Generation HPC Network Design Space Exploration

A Generic Framework for Building Heterogeneous Simulations of Parallel and Distributed Computing Systems

Article 20 March 2017

StorAlloc: A Simulator for Job Scheduling on Heterogeneous Storage Resources

References

Varga A (2001) The OMNeT++ discrete event simulation system. In: Proceedings of the European simulation multiconference (ESM’2001), Prague, Czech Republic, 2001
Varga A (2007) The INET framework. http://ctieware.eng.monash.edu.au/twiki/bin/view/Simulation/INETFramework
Gropp W, Huss-Lederman S, Lumsdaine A, Lusk E, Netzberg B, Saphir W, Snir M (1998) MPI: the complete reference, vol 2—The MPI-2 extensions. MTI-Press
Berenbrink P, Brinkmann A, Scheideler C (2001) SIMLAB: a simulation environment for storage area networks. In: Proceedings of the 9th Euromicro workshop on parallel and distributed processing, Mantova, Italy, 2001, pp 227–234
Molero X, Silla F, Santonja V, Duato J (2000) Modeling and simulation of storage area networks. In: MASCOTS ’00: proceedings of the 8th international symposium on modeling, analysis and simulation of computer and telecommunication systems, Washington, DC, USA, 2000, pp 307–314
Bagrodia R, Meyer R, Takai M, Chen Y, Zeng X, Martin J, Song HY (1998) PARSEC: a parallel simulation environment for complex systems. Comput Mag 31(10):77–85
Google Scholar
Bajaj S, Breslau L, Estrin D, Fall K, Floyd S, Haldar P, Handley M, Helmy A, Heidemann J, Huang P, Kumar S, McCanne S, Rejaie R, Sharma P, Varadhan K, Xu Y, Yu H, Zappala D (1999) Improving simulation for network research. University of Southern California. Tech. Rep. 99-702b, March (1999). http://www.isi.edu/~johnh/PAPERS/Bajaj99a.html
Martin MMK, Sorin DJ, Beckmann BM, Marty MR, Xu M, Alameldeen AR, Moore KE, Hill MD, Wood DA (2005) Multifacet’s general execution-driven multiprocessor simulator (GEMS) toolset. ACM SIGARCH Comput Archit News 33(4):92–99
Article Google Scholar
Hardavellas N, Somogyi S, Wenisch TF, Wunderlich E, Chen S, Kim J, Falsafi B, Hoe JC, Nowatzyk AG (2004) Simflex: a fast, accurate, flexible full-system simulation framework for performance evaluation of server architecture. SIGMETRICS Perform Eval Rev 31:31–35
Article Google Scholar
Prakash S, Bagrodia RL (1998) MPI-SIM: using parallel simulation to evaluate MPI programs. In: Winter Simulation Conference Proceedings, Washington, DC, USA vol 1, 1998, pp 467–474
Bagrodia R, Deelman E, Phan T (2001) Parallel simulation of large-scale parallel applications. Int J High Perform Comput Appl 15(1):3–12
Article Google Scholar
Corbett PF, Feitelson DG (1996) The Vesta parallel file system. ACM Trans Comput Syst 14(3):225–264
Article Google Scholar
Riesen R (2006) A hybrid MPI simulator. In: 2006 IEEE international conference on cluster computing, 2006, pp 1–9
Khnemann M, Rauber T, Runger G (2004) A source code analyzer for performance prediction. In: 18th international parallel and distributed processing symposium (CDROM), 2004
Adve VS, Vernon MK (2004) Parallel program performance prediction using deterministic task graph analysis. ACM Trans Comput Syst (TOCS) 22(1):94–136
Article Google Scholar
Bagrodia R, Deeljman E, Docy S, Phan T (1999) Performance prediction of large parallel applications using parallel simulations. ACM SIGPLAN Not 34(8):151–162
Article Google Scholar
Adve V, Sakellariou R (2000) Application representations for multiparadigm performance modeling of large-scale parallel scientific codes. Int J High Perform Comput Appl 14(4):304–316
Article Google Scholar
Sundaram-Stukel D, Vernon MK (1999) Predictive analysis of a wavefront application using LogGP. ACM SIGPLAN Not 34(8):141–150
Article Google Scholar
Loureiro A, González J, Pena TF (2003) A parallel 3D semiconductor device simulator for gradual heterojunction bipolar transistors. J Numer Model: Electron Netw Devices Fields 16:53–66
Article MATH Google Scholar
Filgueira R, Singh DE, Isaila F, Carretero J, García Loureiro AJ (2007) Optimization and evaluation of parallel I/O in BIPS3D parallel irregular application. In: 21st IEEE international parallel and distributed processing symposium, Long Beach, USA, 2007, pp 1–8
Karypis G, Kumar V (1998) METIS. A software package for partitioning unstructured graphs, partitioning meshes and computing fill-reducing orderings of sparse matrices. Department of Computer Science/Army HPC Research Center, University of Minnesota, Minneapolis
Google Scholar
Gropp W, Lusk E (1998) Users’s guide for MPE: extensions for MPI programs. Argonne National Laboratory

Download references

Author information

Authors and Affiliations

Computer Architecture Group, Computer Science Department, Universidad Carlos III de Madrid, Leganés, Madrid, Spain
Alberto Núñez, Javier Fernández, Jose D. Garcia, Félix Garcia & Jesús Carretero

Authors

Alberto Núñez
View author publications
You can also search for this author in PubMed Google Scholar
Javier Fernández
View author publications
You can also search for this author in PubMed Google Scholar
Jose D. Garcia
View author publications
You can also search for this author in PubMed Google Scholar
Félix Garcia
View author publications
You can also search for this author in PubMed Google Scholar
Jesús Carretero
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alberto Núñez.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Núñez, A., Fernández, J., Garcia, J.D. et al. New techniques for simulating high performance MPI applications on large storage networks. J Supercomput 51, 40–57 (2010). https://doi.org/10.1007/s11227-009-0279-4

Download citation

Received: 28 November 2008
Accepted: 23 February 2009
Published: 17 March 2009
Issue Date: January 2010
DOI: https://doi.org/10.1007/s11227-009-0279-4

New techniques for simulating high performance MPI applications on large storage networks

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Enabling Multi-level Network Modeling in Structural Simulation Toolkit for Next-Generation HPC Network Design Space Exploration

A Generic Framework for Building Heterogeneous Simulations of Parallel and Distributed Computing Systems

StorAlloc: A Simulator for Job Scheduling on Heterogeneous Storage Resources

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

New techniques for simulating high performance MPI applications on large storage networks

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Enabling Multi-level Network Modeling in Structural Simulation Toolkit for Next-Generation HPC Network Design Space Exploration

A Generic Framework for Building Heterogeneous Simulations of Parallel and Distributed Computing Systems

StorAlloc: A Simulator for Job Scheduling on Heterogeneous Storage Resources

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now