Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/1870926.1871101acmconferencesArticle/Chapter ViewAbstractPublication PagesdateConference Proceedingsconference-collections
research-article

HW/SW co-detection of transient and permanent faults with fast recovery in statically scheduled data paths

Published: 08 March 2010 Publication History

Abstract

This paper describes a hardware-/software-based technique to make the data path of a statically scheduled super scalar processor fault tolerant. The results of concurrently executed operations can be compared with little hardware overhead in order to detect a transient or permanent fault. Furthermore, the hardware extension allows to recover from a fault within one to two clock cycles and to distinguish between transient and permanent faults. If a permanent fault was detected, this fault is masked for the rest of the program execution such that no further time is needed for recovering from that fault. The proposed extensions were implemented in the data path of a simple VLIW processor in order to prove the feasibility and to determine the hardware overhead. Finally a reliability analysis is presented. It shows that for medium and large scaled data paths our extension provides an up to 98% better reliability than triple modular redundancy.

References

[1]
C. Bolchini and F. Salice: A Software Methodology for Detecting Hardware Faults in VLIW Data Paths. Proc. of the 2001 IEEE Int. Symp. on Defect and Fault Tolerance in VLSI Systems (DFT'01), pp. 170--175, 2001.
[2]
W. Chan and A. Orailoglu: High-Level Synthesis of Gracefully Degradable ASICs. Proc. of the European Design and Test Conference (ED&TC'96), pp. 50--54, 1996.
[3]
Y.-Y. Chen, H. Shi-Jinn and L. Hung-Chuan: An Integrated Fault-Tolerant Design Framework for VLIW Processors. 18th IEEE International Symposium on Defect and Fault Tolerance in VLSI Systems (DFT'03), pp. 555--562, 2003.
[4]
M. Franklin: A Study of Time Redundant Fault Tolerance Techniques for Superscalar Processors. International IEEE Workshop on Defect and Fault Tolerance in VLSI Systems (DFT'95), pp. 207--215, 1995.
[5]
O. Goloubeva, M. Rebaudengo, M. S. Reorda and M. Violante: Soft-error Detection Using Control Flow Assertions. Proc. of the 18th International Symposium on Defect and Fault-Tolerance in VLSI Systems (DFT'03), pp. 581--588, 2003.
[6]
J. G. Holm and P. Banerjee: Low Cost Concurrent Error Detection in a VLIW Architecture Using Replicated Instructions. Procedings of the International Conference on Parallel Processing (ICPP), pp. 192--195, 1992.
[7]
J. Hu, F. Li, V. Degalahal et. al.: Compiler-Assisted Soft Error Detection under Performance and Energy Constraints in Embedded Systems. ACM Transactions on Embedded Computing Systems, 8(4), pp. 27:1--27:30, 2009.
[8]
R. Karri, K. Kim and M. Potkonjak: Computer Aided Design of Fault-Tolerant Application Specific Programmable Processors. IEEE Transactions on Computers, 49(11), pp. 1272--1284, 2000.
[9]
I. Koren and C. M. Krishna: Fault-Tolerant Systems. Morgan Kaufmann, 2007.
[10]
A. Orailoglu: Microarchitectural Synthesis of Gracefully Degradable, Dynamically Reconfigurable ASICs. International Conference on Computer Design (ICCD'96), pp. 112--117, 1996.
[11]
M. Rebaudengo, M. S. Reorda, M. Torchiano and M. Violante: Soft-error Detection through Software Fault-Tolerance techniques. Proc. of the 14th International Symposium on Defect and Fault-Tolerance in VLSI Systems (DFT'99), pp. 210--218, 1999.

Cited By

View all
  • (2012)A new SBST algorithm for testing the register file of VLIW processorsProceedings of the Conference on Design, Automation and Test in Europe10.5555/2492708.2492812(412-417)Online publication date: 12-Mar-2012

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
DATE '10: Proceedings of the Conference on Design, Automation and Test in Europe
March 2010
1868 pages
ISBN:9783981080162

Sponsors

  • EDAA: European Design Automation Association
  • ECSI
  • EDAC: Electronic Design Automation Consortium
  • SIGDA: ACM Special Interest Group on Design Automation
  • The IEEE Computer Society TTTC
  • The IEEE Computer Society DATC
  • The Russian Academy of Sciences: The Russian Academy of Sciences

Publisher

European Design and Automation Association

Leuven, Belgium

Publication History

Published: 08 March 2010

Check for updates

Qualifiers

  • Research-article

Conference

DATE '10
Sponsor:
  • EDAA
  • EDAC
  • SIGDA
  • The Russian Academy of Sciences
DATE '10: Design, Automation and Test in Europe
March 8 - 12, 2010
Germany, Dresden

Acceptance Rates

Overall Acceptance Rate 518 of 1,794 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 29 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2012)A new SBST algorithm for testing the register file of VLIW processorsProceedings of the Conference on Design, Automation and Test in Europe10.5555/2492708.2492812(412-417)Online publication date: 12-Mar-2012

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media