A Methodology for the Rapid Injection of Transient Hardware Errors

Authors:
Charles R. Yount;Daniel P. Siewiorek
Affiliations:
Intel Corp., Folsom, CA;Carnegie Mellon Univ., Pittsburgh, PA
Venue:
IEEE Transactions on Computers
Year:
1996

Citing 7
Cited 7

A measurement-based model for workload dependence of CPU errors

IEEE Transactions on Computers - The MIT Press scientific computation series
Fault Injection Experiments Using FIAT

IEEE Transactions on Computers
On the prediction of fault behavior based on workload

On the prediction of fault behavior based on workload
Reliable computer systems (2nd ed.): design and evaluation

Reliable computer systems (2nd ed.): design and evaluation
Observations on the Effects of Fault Manifestation as a Function of Workload

IEEE Transactions on Computers - Special issue on fault-tolerant computing
The automatic generation of instruction-level error manifestations of hardware faults: a new fault-injection model

The automatic generation of instruction-level error manifestations of hardware faults: a new fault-injection model
FOCUS: An Experimental Environment for Fault Sensitivity Analysis

IEEE Transactions on Computers

Validation-Based Development of Dependable Systems

IEEE Micro
Xception: A Technique for the Experimental Evaluation of Dependability in Modern Computers

IEEE Transactions on Software Engineering
A Dependability-Explicit Model for the Development of Computing Systems

SAFECOMP '00 Proceedings of the 19th International Conference on Computer Safety, Reliability and Security
From Experimental Assessment of Fault-Tolerant Systems to Dependability Benchmarking

IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Comparison of Physical and Software-Implemented Fault Injection Techniques

IEEE Transactions on Computers
Reflections on Industry Trends and Experimental Research in Dependability

IEEE Transactions on Dependable and Secure Computing
Automatic generation of defect injectable VHDL fault models for ASIC standard cell libraries

Integration, the VLSI Journal

Quantified Score

Hi-index	14.98

Visualization

Abstract

Ultra-dependable computing demands verification of fault-tolerant mechanisms in the hardware. The most popular class of verification methodologies, fault-injection, is fraught with a host of limitations. Methods which are rapid enough to be feasible are not based on actual hardware faults. On the other hand, methods which are based on gate-level faults require enormous time resources. This research tries to bridge that gap by developing a new fault-injection methodology for processors based on a register-transfer-language (RTL) fault model. The fault model is developed by abstracting the effects of low-level faults to the RTL level. This process attempts to be independent of implementation details without sacrificing coverage, the proportion of errors generated by gate-level faults that are successfully reproduced by the RTL fault model. A prototype tool, ASPHALT, is described which automates the process of generating the error patterns. The IBM RISC-Oriented Micro-Processor (ROMP) is used as a basis for experimentation. Over 1.5 million transient faults are injected using a gate-level model. Over 97% of these are reproduced with the RTL model at a speedup factor of over 500:1. These results show that the RTL fault model may be used to greatly accelerate fault-injection experiments without sacrificing accuracy.