Cooperative Write-Behind Data Buffering for MPI I/O

Wei-keng Liao¹⁹,
Kenin Coloma¹⁹,
Alok Choudhary¹⁹ &
…
Lee Ward²⁰

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 3666))

Included in the following conference series:

European Parallel Virtual Machine / Message Passing Interface Users’ Group Meeting

828 Accesses
6 Citations

Abstract

Many large-scale production parallel programs often run for a very long time and require data checkpoint periodically to save the state of the computation for program restart and/or tracing the progress. Such a write-only pattern has become a dominant part of an application’s I/O workload and implies the importance of its optimization. Existing approaches for write-behind data buffering at both file system and MPI I/O levels have been proposed, but challenges still exist for efficient design to maintain data consistency among distributed buffers. To address this problem, we propose a buffering scheme that coordinates the compute processes to achieve the consistency control. Different from other earlier work, our design can be applied to files opened in read-write mode and handle the patterns with mixed MPI collective and independent I/O calls. Performance evaluation using BTIO and FLASH IO benchmarks is presented, which shows a significant improvement over the method without buffering.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A High-Performance Collective I/O Framework Leveraging Node-Local Persistent Memory

An MPI-IO In-Memory Driver for Non-volatile Pooled Memory of the Kove XPD

Hyper Burst Buffer: A Lightweight Burst Buffer I/O Library for High Performance Computing Applications

References

Callaghan, B.: NFS Illustrated. Addison-Wesley, Reading (2000)
Google Scholar
Ma, X., Winslett, M., Lee, J., Yu, S.: Improving MPI-IO Output Performance with Active Buffering Plus Threads. In: The International Parallel and Distributed Processing Symposium, IPDPS (2003)
Google Scholar
Thakur, R., Gropp, W., Lusk, E.: Users Guide for ROMIO: A High-Performance, Portable MPI-IO Implementation. Technical Report ANL/MCS-TM-234, Mathematics and Computer Science Division, Argonne National Laboratory (1997)
Google Scholar
Message Passing Interface Forum: MPI-2: Extensions to the Message Passing Interface (1997), http://www.mpi-forum.org/docs/docs.html
Purakayastha, A., Ellis, C.S., Kotz, D.: ENWRICH: A Compute-Processor Write Caching Scheme for Parallel File Systems. In: The Fourth Workshop on Input/Output in Parallel and Distributed Systems, IOPADS (1996)
Google Scholar
Prost, J., Treumann, R., Hedges, R., Jia, B., Koniges, A.: MPI-IO/GPFS, an Optimized Implementation of MPI-IO on top of GPFS. In: Supercomputing (2001)
Google Scholar
Schmuck, F., Haskin, R.: GPFS: A Shared-Disk File System for Large Computing Clusters. In: The Conference on File and Storage Technologies (FAST 2002), pp. 231–244 (2002)
Google Scholar
Bernstein, P., Hadzilacos, V., Goodman, N.: Concurrency Control and Recovery in Database Systems. Addison-Wesley, Reading (1987)
Google Scholar
IEEE/ANSI Std. 1003.1: Portable Operating System Interface (POSIX)-Part 1: System Application Program Interface (API) [C Language]. (1996)
Google Scholar
Thakur, R., Gropp, W., Lusk, E.: On Implementing MPI-IO Portably and with High Performance. In: The Sixth Workshop on I/O in Parallel and Distributed Systems, pp. 23–32 (1999)
Google Scholar
Thakur, R., Gropp, W., Lusk, E.: An Abstract-Device Interface for Implementing Portable Parallel-I/O Interfaces. In: The 6th Symposium on the Frontiers of Massively Parallel Computation (1996)
Google Scholar
Thakur, R., Gropp, W., Lusk, E.: Data Sieving and Collective I/O in ROMIO. In: The 7th Symposium on the Frontiers of Massively Parallel Computation (1999)
Google Scholar
Wong, P., der Wijngaart, R.: NAS Parallel Benchmarks I/O Version 2.4. Technical Report NAS-03-002, NASA Ames Research Center, Moffet Field, CA (2003)
Google Scholar
Fineberg, S., Wong, P., Nitzberg, B., Kuszmaul, C.: PMPIO - A Portable Implementation of MPI-IO. In: The 6th Symposium on the Frontiers of Massively Parallel Computation (1996)
Google Scholar
Fryxell, B., Olson, K., Ricker, P., Timmes, F.X., Zingale, M., Lamb, D.Q., MacNeice, P., Rosner, R., Tufo, H.: FLASH: An Adaptive Mesh Hydrodynamics Code for Modelling Astrophysical Thermonuclear Flashes. Astrophysical Journal Suppliment, 131–273 (2000)
Google Scholar
Zingale, M.: FLASH I/O Benchmark Routine – Parallel HDF 5 (2001), http://flash.uchicago.edu/~zingale/flash_benchmark_io

Download references

Author information

Authors and Affiliations

Electrical and Computer Engineering Department, Northwestern University,
Wei-keng Liao, Kenin Coloma & Alok Choudhary
Scalable Computing Systems Department, Sandia National Laboratories,
Lee Ward

Authors

Wei-keng Liao
View author publications
You can also search for this author in PubMed Google Scholar
Kenin Coloma
View author publications
You can also search for this author in PubMed Google Scholar
Alok Choudhary
View author publications
You can also search for this author in PubMed Google Scholar
Lee Ward
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Ingegneria dell’ Informazione, Second University of Naples - Italy, Real Casa dell’Annunziata - via Roma, 29, 81031, Aversa, CE, Italy
Beniamino Di Martino
GUP, Institute of Graphics and Parallel Processing, Johannes Kepler University, Altenbergerstraße 69, A-4040, Linz, Austria
Dieter Kranzlmüller
Computer Science Department, University of Tennessee, 37996-3450, Knoxville, TN, USA
Jack Dongarra

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liao, Wk., Coloma, K., Choudhary, A., Ward, L. (2005). Cooperative Write-Behind Data Buffering for MPI I/O. In: Di Martino, B., Kranzlmüller, D., Dongarra, J. (eds) Recent Advances in Parallel Virtual Machine and Message Passing Interface. EuroPVM/MPI 2005. Lecture Notes in Computer Science, vol 3666. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11557265_17

Download citation

DOI: https://doi.org/10.1007/11557265_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29009-4
Online ISBN: 978-3-540-31943-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Cooperative Write-Behind Data Buffering for MPI I/O

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A High-Performance Collective I/O Framework Leveraging Node-Local Persistent Memory

An MPI-IO In-Memory Driver for Non-volatile Pooled Memory of the Kove XPD

Hyper Burst Buffer: A Lightweight Burst Buffer I/O Library for High Performance Computing Applications

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Cooperative Write-Behind Data Buffering for MPI I/O

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A High-Performance Collective I/O Framework Leveraging Node-Local Persistent Memory

An MPI-IO In-Memory Driver for Non-volatile Pooled Memory of the Kove XPD

Hyper Burst Buffer: A Lightweight Burst Buffer I/O Library for High Performance Computing Applications

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation