Abstract
Exascale supercomputers will have the potential for billion-way parallelism. While physical implementations of these systems are currently not available, HPC system designers can develop models of exascale systems to evaluate system design points. Modeling these systems and associated subsystems is a significant challenge. In this paper, we present the Co-design of Exascale Storage System (CODES) framework for evaluating exascale storage system design points. As part of our early work with CODES, we discuss the use of the CODES framework to simulate leadership-scale storage systems in a tractable amount of time using parallel discrete-event simulation. We describe the current storage system models and protocols included with the CODES framework and demonstrate the use of CODES through simulations of an existing petascale storage system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Overview of the IBM Blue Gene/P project. IBM Journal of Research and Development, 52(1.2), 199–220 (January 2008)
IOR benchmark (October 2011)
Ang, J., Doerfler, D., Dosanjh, S., Koch, K., Morrison, J., Vigil, M.: The alliance for computing at the extreme scale. In: Proceedings of the Cray Users Group Meeting (2010)
Bauer, D.W., Carothers, C.D., Holder, A.: Scalable time warp on Blue Gene supercomputers. In: Proc. ACM/IEEE Workshop on Principles of Advanced and Distributed Simulation (PADS 2009), Lake Placid, NY (2009)
Carns, P.H., Ligon III, W.B., Ross, R.B., Thakur, R.: PVFS: A Parallel File System for Linux Clusters. In: Proceedings of the 4th Annual Linux Showcase and Conference, pp. 317–327 (2000)
Lang, S., Carns, P., Latham, R., Ross, R., Harms, K., Allcock, W.: I/O performance challenges at leadership scale. In: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, p. 40. ACM (2009)
Liu, N., Carothers, C.D.: Modeling billion-node torus networks using massively parallel discrete-event simulation. In: Proceedings of the Workshop on Principles of Advanced and Distributed Simulation (PADS), pp. 1–8. IEEE, France (2011)
Liu, Y., Figueiredo, R., Clavijo, D., Xu, Y., Zhao, M.: Towards simulation of parallel file system scheduling algorithms with PFSsim. In: Proceedings of the 7th IEEE International Workshop on Storage Network Architectures and Parallel I/O (May 2011)
Molina-Estolano, E., Maltzahn, C., Bent, J., Brandt, S.A.: Building a parallel file system simulator. Journal of Physics: Conference Series 180, 012050 (2009)
Moreira, J., Brutman, M., Castaños, J., Engelsiepen, T., Giampapa, M., Gooding, T., Haskin, R., Inglett, T., Lieber, D., McCarthy, P., Mundy, M., Parker, J., Wallenfelt, B.: Designing a highly-scalable operating system: the blue gene/l story. In: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, SC 2006. ACM, New York (2006)
Nguyen, H.Q.: File system simulation: Hierachical performance measurement and modeling. PhD thesis, University of Arkansas (2011)
Nguyen, H.Q., Apon, A.W.: Hierarchical performance measurement and modeling of the linux file system. In: ICPE, pp. 73–84 (2011)
Perumalla, K.S.: μπ: a scalable and transparent system for simulating MPI programs. In: Proceedings of the 3rd International ICST Conference on Simulation Tools and Techniques, SIMUTools 2010, pp. 62:1–62:6. ICST, Brussels (2010)
Rodrigues, A.F., Hemmert, K.S., Barrett, B.W., Kersey, C., Oldfield, R., Weston, M., Risen, R., Cook, J., Rosenfeld, P., CooperBalls, E., Jacob, B.: The structural simulation toolkit. SIGMETRICS Perform. Eval. Rev. 38, 37–42 (2011)
Schmuck, F., Haskin, R.: Gpfs: A shared-disk file system for large computing clusters. In: Proceedings of the 1st USENIX Conference on File and Storage Technologies (2002)
Settlemyer, B.W.: A Study of Client-side Caching in Parallel File Systems. PhD thesis, Clemson University, Clemson, South Carolina, USA (2009)
Shipman, G., Dillow, D., Oral, S., Wang, F.: The spider center wide file system: From concept to reality. In: Proceedings, Cray User Group (CUG) Conference, Atlanta, GA (2009)
Yaun, G., Carothers, C.D., Kalyanaraman, S.: Large-scale TCP models using optimistic parallel simulation. In: Proceedings of the Seventeenth Workshop on Parallel and Distributed Simulation (PADS 2003), San Diego, CA (June 2003)
Yaun, G.R., Bauer, D.W., Bhutada, H.L., Carothers, C.D., Yuksel, M., Kalyanaraman, S.: Largescale network simulation techniques: Examples of TCP and OSPF models. SIGCOMM Computer Comunications Review Special Issue on Tools and Technologies for Research and Eduction 33(5), 27–41 (2004)
Zheng, G., Gupta, G., Bohm, E., Dooley, I., Kale, L.V.: Simulating Large Scale Parallel Applications using Statistical Models for Sequential Execution Blocks. In: Proceedings of the 16th International Conference on Parallel and Distributed Systems (ICPADS 2010), Shanghai, China, pp. 10–15 (December 2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liu, N. et al. (2012). Modeling a Leadership-Scale Storage System. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Waśniewski, J. (eds) Parallel Processing and Applied Mathematics. PPAM 2011. Lecture Notes in Computer Science, vol 7203. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31464-3_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-31464-3_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31463-6
Online ISBN: 978-3-642-31464-3
eBook Packages: Computer ScienceComputer Science (R0)