Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1066677.1067026acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
Article

Space-efficient page-level incremental checkpointing

Published: 13 March 2005 Publication History

Abstract

Incremental checkpointing, which is intended to minimize checkpointing overhead, saves only the modified pages of a process. However, the cumulative size of incremental checkpoints increases at a steady rate over time because many updated values may be saved for the same page. In this paper, we present a comprehensive overview of Pickpt, which is a page-level incremental checkpointing facility. Pickpt provides space-efficient techniques for minimizing the use of disk space. For our experiments, the results show that the use of disk space of Pickpt was significantly reduced compared with existing incremental checkpointing.

References

[1]
M. Beck, J. S. Plank, and G. Kingsley. Compiler-assisted checkpointing. Technical Report UT-CS-94-269, University of Tennessee, 1994.]]
[2]
J. Heo, S. Yi, J. Hong, Y. Cho, and J. Choi. An efficient merging algorithm for recovery and garbage collection in incremental checkpointing. In IASTED International Conference on Parallel and, Distributed and Networks, pages 365--368, February 2004.]]
[3]
J. Hong, S. Kim, and Y. Cho. Cost analysis of optimistic recovery model for forked checkpointing. IEICE Transactions on Information and Systems, E86-D(9):1534--1541, September 2003.]]
[4]
J. Hong, T. Park, H. Yeom, and Y. Cho. Kckpt: An efficient checkpoint facility on unixware. In International Conference on Computers and Their Applications, pages 303--308, March 2000.]]
[5]
J. Lawall and G. Muller. Efficient incremental checkpointing of java programs. In IEEE Proceedings of the International Conference on Dependable Systems and Networks, pages 61--70, June 2000.]]
[6]
M. Litzkow, T. Tannenbaun, J. Basney, and M. Livny. Checkpoint and migration of unix processes in the condor distributed processing system. Technical Report 1346, Department of Computer Science, Univ. of Wisconsin-Madison, 1997.]]
[7]
J. Plank, M. Beck, and G. Kingsley. Compiler-assisted memory exclusion for fast checkpointing. In IEEE Technical Committee on Operating Systems and Application Environments, Special Issue on Fault-Tolerance, pages 62--67, December 1995.]]
[8]
J. Plank, M. Beck, G. Kingsley, and K. Li. Libckpt:transparent checkpointing under unix. In Usenix Winter Technical Conference, pages 213--223, January 1995.]]
[9]
J. Plank, Y. Chen, M. B. K. Li, and G. Kingsley. Memory exclusion: optimizing the performance of checkpointing systems. Software Practice and Experience, 29(2): 125--142, February 1999.]]
[10]
J. Plank, K. Li, and M. Puening. Diskless checkpointing. IEEE Transactions on Parallel and Distributed Systems, 9(10):303--308, October 1998.]]
[11]
J. Plank, J. Xu, and R. Netzer. Compressed differences: An algorithm for fast incremental checkpointing. Technical Report CS-95-302, University of Tennessee, August 1995.]]
[12]
A. Ziv and J. Bruck. An on-line algorithm for checkpoint placement. IEEE Transactions on Computers, 46(9):976--985, September 1997.]]

Cited By

View all
  • (2020)Checkpointing OpenSHMEM Programs Using Compiler Analysis2020 IEEE/ACM 10th Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS)10.1109/FTXS51974.2020.00011(51-60)Online publication date: Nov-2020
  • (2019)GPU snapshotProceedings of the ACM International Conference on Supercomputing10.1145/3330345.3330361(171-183)Online publication date: 26-Jun-2019
  • (2019)CAPE: A Checkpointing-Based Solution for OpenMP on Distributed-Memory ArchitecturesParallel Computing Technologies10.1007/978-3-030-25636-4_8(93-106)Online publication date: 17-Jul-2019
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SAC '05: Proceedings of the 2005 ACM symposium on Applied computing
March 2005
1814 pages
ISBN:1581139640
DOI:10.1145/1066677
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 March 2005

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. checkpoint and recovery
  2. fault tolerance
  3. linux kernel
  4. page-level incremental checkpointing

Qualifiers

  • Article

Conference

SAC05
Sponsor:
SAC05: The 2005 ACM Symposium on Applied Computing
March 13 - 17, 2005
New Mexico, Santa Fe

Acceptance Rates

Overall Acceptance Rate 1,650 of 6,669 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 29 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2020)Checkpointing OpenSHMEM Programs Using Compiler Analysis2020 IEEE/ACM 10th Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS)10.1109/FTXS51974.2020.00011(51-60)Online publication date: Nov-2020
  • (2019)GPU snapshotProceedings of the ACM International Conference on Supercomputing10.1145/3330345.3330361(171-183)Online publication date: 26-Jun-2019
  • (2019)CAPE: A Checkpointing-Based Solution for OpenMP on Distributed-Memory ArchitecturesParallel Computing Technologies10.1007/978-3-030-25636-4_8(93-106)Online publication date: 17-Jul-2019
  • (2016)Peeking into the Past: Efficient Checkpoint-Assisted Time-Traveling Debugging2016 IEEE 27th International Symposium on Software Reliability Engineering (ISSRE)10.1109/ISSRE.2016.9(455-466)Online publication date: Oct-2016
  • (2015)Speculative Memory CheckpointingProceedings of the 16th Annual Middleware Conference10.1145/2814576.2814802(197-209)Online publication date: 24-Nov-2015
  • (2015)Efficiently Restoring Virtual MachinesInternational Journal of Parallel Programming10.1007/s10766-013-0295-043:3(421-439)Online publication date: 1-Jun-2015
  • (2012)VPCProceedings of the 2012 IEEE 24th International Symposium on Computer Architecture and High Performance Computing10.1109/SBAC-PAD.2012.31(203-210)Online publication date: 24-Oct-2012
  • (2012)Assessing HPC Failure Detectors for MPI JobsProceedings of the 2012 20th Euromicro International Conference on Parallel, Distributed and Network-based Processing10.1109/PDP.2012.11(81-88)Online publication date: 15-Feb-2012
  • (2010)Hybrid Checkpointing for MPI Jobs in HPC EnvironmentsProceedings of the 2010 IEEE 16th International Conference on Parallel and Distributed Systems10.1109/ICPADS.2010.48(524-533)Online publication date: 8-Dec-2010
  • (2010)A New Concurrent Checkpoint Mechanism for Real-Time and Interactive ProcessesProceedings of the 2010 IEEE 34th Annual Computer Software and Applications Conference10.1109/COMPSAC.2010.12(47-52)Online publication date: 19-Jul-2010
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media