Fault tolerant and fault testable hardware design
Fault tolerant and fault testable hardware design
A measurement-based model for workload dependence of CPU errors
IEEE Transactions on Computers - The MIT Press scientific computation series
Fault Injection for Dependability Validation: A Methodology and Some Applications
IEEE Transactions on Software Engineering
FINE: A Fault Injection and Monitoring Environment for Tracing the UNIX System Behavior Under Faults
IEEE Transactions on Software Engineering - Special issue on software reliability
The POWER2 performance monitor
IBM Journal of Research and Development
Estimators for Fault Tolerance Coverage Evaluation
IEEE Transactions on Computers - Special issue on fault-tolerant computing
A Methodology for the Rapid Injection of Transient Hardware Errors
IEEE Transactions on Computers
Computer
Fault Injection Techniques and Tools
Computer
FOCUS: An Experimental Environment for Fault Sensitivity Analysis
IEEE Transactions on Computers
RIFLE: A General Purpose Pin-level Fault Injector
EDCC-1 Proceedings of the First European Dependable Computing Conference on Dependable Computing
Experimental assessment of parallel systems
FTCS '96 Proceedings of the The Twenty-Sixth Annual International Symposium on Fault-Tolerant Computing (FTCS '96)
Generation of an error set that emulates software faults based on field data
FTCS '96 Proceedings of the The Twenty-Sixth Annual International Symposium on Fault-Tolerant Computing (FTCS '96)
An approach towards benchmarking of fault-tolerant commercial systems
FTCS '96 Proceedings of the The Twenty-Sixth Annual International Symposium on Fault-Tolerant Computing (FTCS '96)
Dependability evaluation using hybrid fault/error injection
IPDS '95 Proceedings of the International Computer Performance and Dependability Symposium on Computer Performance and Dependability Symposium
Assessing the effects of communication faults on parallel applications
IPDS '95 Proceedings of the International Computer Performance and Dependability Symposium on Computer Performance and Dependability Symposium
FTCS'95 Proceedings of the Twenty-Fifth international conference on Fault-tolerant computing
GUARDS: A Generic Upgradable Architecture for Real-Time Dependable Systems
IEEE Transactions on Parallel and Distributed Systems
Teraflops Supercomputer: Architecture and Validation of the Fault Tolerance Mechanisms
IEEE Transactions on Computers
The Design and Verification of the Rio File Cache
IEEE Transactions on Computers
Dependability of COTS Microkernel-Based Systems
IEEE Transactions on Computers - Special issue on fault-tolerant embedded systems
PROPANE: an environment for examining the propagation of errors in software
ISSTA '02 Proceedings of the 2002 ACM SIGSOFT international symposium on Software testing and analysis
Fault-Detection by Result-Checking for the Eigenproblem
EDCC-3 Proceedings of the Third European Dependable Computing Conference on Dependable Computing
Can Software Implemented Fault-Injection Be Used on Real-Time Systems?
EDCC-3 Proceedings of the Third European Dependable Computing Conference on Dependable Computing
MAFALDA: Microkernel Assessment by Fault Injection and Design Aid
EDCC-3 Proceedings of the Third European Dependable Computing Conference on Dependable Computing
Assessing Error Detection Coverage by Simulated Fault Injection
EDCC-3 Proceedings of the Third European Dependable Computing Conference on Dependable Computing
EDCC-4 Proceedings of the 4th European Dependable Computing Conference on Dependable Computing
Dependability of Computer Control Systems in Power Plants
SAFECOMP '00 Proceedings of the 19th International Conference on Computer Safety, Reliability and Security
Experimental Evaluation of Fault Handling Mechanisms
SAFECOMP '01 Proceedings of the 20th International Conference on Computer Safety, Reliability and Security
Performance Evaluation of Exception Handling in I/O Libraries
DSN '01 Proceedings of the 2001 International Conference on Dependable Systems and Networks (formerly: FTCS)
Software-Implemented Fault Injection Methodology for Design and Validation of System Fault Tolerance
DSN '01 Proceedings of the 2001 International Conference on Dependable Systems and Networks (formerly: FTCS)
DSN '01 Proceedings of the 2001 International Conference on Dependable Systems and Networks (formerly: FTCS)
GOOFI: Generic Object-Oriented Fault Injection Tool
DSN '01 Proceedings of the 2001 International Conference on Dependable Systems and Networks (formerly: FTCS)
Comparison of Physical and Software-Implemented Fault Injection Techniques
IEEE Transactions on Computers
Automatic Detection and Masking of Nonatomic Exception Handling
IEEE Transactions on Software Engineering
Reflections on Industry Trends and Experimental Research in Dependability
IEEE Transactions on Dependable and Secure Computing
Software faults: a quantifiable definition
Advances in Engineering Software
Automatic generation of defect injectable VHDL fault models for ASIC standard cell libraries
Integration, the VLSI Journal
Virtual framework for testing the reliability of system software on embedded systems
Proceedings of the 2007 ACM symposium on Applied computing
Experiences in measuring the reliability of a cache-based storage system
WIESS'00 Proceedings of the 1st conference on Industrial Experiences with Systems Software - Volume 1
Study of the Effects of SEU-Induced Faults on a Pipeline Protected Microprocessor
IEEE Transactions on Computers
Information Assurance: Dependability and Security in Networked Systems
Information Assurance: Dependability and Security in Networked Systems
Exhaustif®: a fault injection tool for distributed heterogeneous embedded systems
EATIS '07 Proceedings of the 2007 Euro American conference on Telematics and information systems
Case-based software reliability assessmentby fault injection unified procedures
Proceedings of the 2008 international workshop on Software Engineering in east and south europe
Fault injection framework for system resilience evaluation: fake faults for finding future failures
Proceedings of the 2009 workshop on Resiliency in high performance
Software faults: A quantifiable definition
Advances in Engineering Software
Exhaustive testing of exception handlers with enforcer
FMCO'06 Proceedings of the 5th international conference on Formal methods for components and objects
Dependability metrics
An effective method to control interrupt handler for data race detection
Proceedings of the 5th Workshop on Automation of Software Test
XML schema based fault set definition to improve fault injection tools interoperability
International Journal of Critical Computer-Based Systems
End-to-end data integrity for file systems: a ZFS case study
FAST'10 Proceedings of the 8th USENIX conference on File and storage technologies
A realistic evaluation of memory hardware errors and software system susceptibility
USENIXATC'10 Proceedings of the 2010 USENIX conference on USENIX annual technical conference
MODIFI: a MODel-implemented fault injection tool
SAFECOMP'10 Proceedings of the 29th international conference on Computer safety, reliability, and security
Fault injection into Verilog models for dependability evaluation of digital systems
ISPDC'03 Proceedings of the Second international conference on Parallel and distributed computing
How to advance TPC benchmarks with dependability aspects
TPCTC'10 Proceedings of the Second TPC technology conference on Performance evaluation, measurement and characterization of complex systems
The impact of fault models on software robustness evaluations
Proceedings of the 33rd International Conference on Software Engineering
Impact of faults in combinational logic of commercial microcontrollers
EDCC'05 Proceedings of the 5th European conference on Dependable Computing
On the effects of errors during boot
LADC'05 Proceedings of the Second Latin-American conference on Dependable Computing
A case for virtual machine based fault injection in a high-performance computing environment
Euro-Par'11 Proceedings of the 2011 international conference on Parallel Processing
Failure boundedness in discrete applications
LADC'07 Proceedings of the Third Latin-American conference on Dependable Computing
SAFECOMP'12 Proceedings of the 31st international conference on Computer Safety, Reliability, and Security
Using unreliable virtual hardware to inject errors in extreme-scale systems
Proceedings of the 3rd Workshop on Fault-tolerance for HPC at extreme scale
Journal of Systems and Software
Combining Fault-Injection with Property-Based Testing
Proceedings of International Workshop on Engineering Simulations for Cyber-Physical Systems
Hi-index | 0.02 |
An important step in the development of dependable systems is the validation of their fault tolerance properties. Fault injection has been widely used for this purpose, however with the rapid increase in processor complexity, traditional techniques are also increasingly more difficult to apply. This paper presents a new software implemented fault injection and monitoring environment, called Xception, which is targeted for the modern and complex processors. Xception uses the advanced debugging and performance monitoring features existing in most of the modern processors to inject quite realistic faults by software, and to monitor the activation of the faults and their impact on the target system behavior in detail. Faults are injected with minimum interference with the target application. The target application is not modified, no software traps are inserted, and it is not necessary to execute the target application in special trace mode (the application is executed at full speed). Xception provides a comprehensive set of fault triggers, including spatial and temporal fault triggers, and triggers related to the manipulation of data in memory. Faults injected by Xception can affect any process running on the target system (including the kernel), and it is possible to inject faults in applications for which the source code is not available. Experimental results are presented to demonstrate the accuracy and potential of Xception in the evaluation of the dependability properties of the complex computer systems available nowadays.