Nothing Special   »   [go: up one dir, main page]

skip to main content
article
Free access

A new checkpoint mechanism for real time operating systems

Published: 01 October 1997 Publication History

Abstract

This paper presents an overview of a proposed protocol to provide application---transparent fault tolerant services in a Real Time Operating system. Fault tolerance is achieved by saving checkpoints of the processes belonging to a real time application. This approach proposes the extension of some real time system calls in order to save a recovery point when the user invokes them. This protocol allows a real time application designer to know the temporal specifications of every system call. Current real time applications are composed of several Real Time processes and they have to share data by using interprocess communication facilities provided by the operating system. The operating system has to take into account these interactions to ensure the consistency of checkpoints. This is done by tracking the communications performed since the last checkpoint and forcing dependent processes to perform a checkpoint at the same time.

References

[1]
{1} R. E. Ahmed, R. Frazier, and P. Marinos. Cache-aided rollback error recovery (CARER) algorithms for shared-memory multiprocessors systems. In FTCS-20, pages 82-88. IEEE, June 1990.
[2]
{2} N. S. Bowen and D. K. Pradhan. Processor - and memory - based checkpoint and rollback recovery. IEEE Computer, 26(2):22-31, February 1993.
[3]
{3} L. M. Censier and P. Feautier. A new solution to coherence problems in multicache systems. IEEE Transactions on Computers, 27(12):1112-1118, December 1978.
[4]
{4} K. Fowler. Inertial navigation system simulation program: Top level design. Technical Report CMU-SEI-89-TR- 38, Carnegie Mellon University, Pittsburgh, PA 15238, USA, January 1990.
[5]
{5} IEEE. IEEE Standard for Information Technology: Portable Operating Systems Interface (POSIX 1.b). IEEE, 1994.
[6]
{6} P. A. Lee and T. Anderson. Fault Tolerance: Principles and Practice. Springer-Verlag, Wien, second edition, 1990.
[7]
{7} K. Li, J. F. Naughton, and J. S. Plank. Real-time concurrent checkpoint for parallel programs. ACM SIGPLAN Notices, 25(3):79-88, March 1990.
[8]
{8} S. Rodríguez. Sistema Operativo de Tiempo Real con Tolerancia a Fallos mediante Puntos de Recuperación. PhD thesis, Dpto. de Arquitectura y Tecnología de Sistemas Informáticos. Universidad Politécnica de Madrid, July 1996.
[9]
{9} B. Sprunt, L. Sha, and J. P. Lehoczky. Aperiodic task scheduling for hard real-time systems. Real-Time Systems, 1(1):27-60, June 1989.
[10]
{10} J. A. Stankovic. Misconceptions about real-time computing. IEEE Computer, 21(10):10-19, October 1988.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM SIGOPS Operating Systems Review
ACM SIGOPS Operating Systems Review  Volume 31, Issue 4
Oct. 1997
91 pages
ISSN:0163-5980
DOI:10.1145/271019
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 October 1997
Published in SIGOPS Volume 31, Issue 4

Check for updates

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 357
    Total Downloads
  • Downloads (Last 12 months)37
  • Downloads (Last 6 weeks)3
Reflects downloads up to 14 Nov 2024

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media