Software-Based Hardware Fault Tolerance for Many-Core Architectures
Page 223
Abstract
Software-based hardware fault tolerance describes a class of techniques which allows software to detect and correct errors introduced by unreliable hardware. With the advent of many-core architectures, the already existing reliability issues, like temporal and structural variations or the sensitivity against soft-errors, are becoming an even more serious problem. Software-based hardware fault tolerance is able to provide cost-effective solutions. This presentation will point out the new opportunities and challenges for applying software-based hardware fault tolerance to emerging many-core architectures. We will discuss the tradeoff between the application of these techniques and the classical hardware-based fault tolerance in terms of fault coverage, overhead, and performance.
Index Terms
- Software-Based Hardware Fault Tolerance for Many-Core Architectures
Recommendations
Hardware/software fault tolerance with multiple task modular redundancy
ISCC '95: Proceedings of the IEEE Symposium on Computers and Communications (ISCC'95)N-modular redundancy (NMR) and N-version programming (NVP) are two popular fault tolerance techniques in which hardware and software redundancy is exploited to mask faults. Redundant hardware is used to improve fault tolerance rather than throughput. We ...
Comments
Please enable JavaScript to view thecomments powered by Disqus.Information & Contributors
Information
Published In
October 2009
400 pages
ISBN:9780769538396
Publisher
IEEE Computer Society
United States
Publication History
Published: 07 October 2009
Author Tags
Qualifiers
- Article
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 0Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Reflects downloads up to 23 Nov 2024