Measurement and Application of Fault Latency
IEEE Transactions on Computers - The MIT Press scientific computation series
Processor Control Flow Monitoring Using Signatured Instruction Streams
IEEE Transactions on Computers
Measurement-Based Analysis of Error Latency
IEEE Transactions on Computers
Synthesizing Robust Data Structures-An Introduction
IEEE Transactions on Computers
Fault Injection for Dependability Validation: A Methodology and Some Applications
IEEE Transactions on Software Engineering
Fault Injection Experiments Using FIAT
IEEE Transactions on Computers
Defect type and its impact on the growth curve
ICSE '91 Proceedings of the 13th international conference on Software engineering
The Effects of Heavy-Ion Induced Single Event Upsets in the MC6809E Microprocessor
Fehlertolerierende Rechensysteme / Fault-Tolerant Computing Systems, Automatisierungssysteme, Methoden, Anwendungen / Automation Systems, Methods, Applications; 4. Internationale GI/ITG/GMA-Fachtagung
FERRARI: A Flexible Software-Based Fault and Error Injection System
IEEE Transactions on Computers - Special issue on fault-tolerant computing
The Rio file cache: surviving operating system crashes
Proceedings of the seventh international conference on Architectural support for programming languages and operating systems
Concurrent Detection of Software and Hardware Data-Access Faults
IEEE Transactions on Computers
Measuring Software Dependability by Robustness Benchmarking
IEEE Transactions on Software Engineering
IEEE Transactions on Parallel and Distributed Systems
The Design and Verification of the Rio File Cache
IEEE Transactions on Computers
Dependability of COTS Microkernel-Based Systems
IEEE Transactions on Computers - Special issue on fault-tolerant embedded systems
Increasing relevance of memory hardware errors: a case for recoverable programming models
EW 9 Proceedings of the 9th workshop on ACM SIGOPS European workshop: beyond the PC: new challenges for the operating system
Xception: A Technique for the Experimental Evaluation of Dependability in Modern Computers
IEEE Transactions on Software Engineering
Can Software Implemented Fault-Injection Be Used on Real-Time Systems?
EDCC-3 Proceedings of the Third European Dependable Computing Conference on Dependable Computing
MAFALDA: Microkernel Assessment by Fault Injection and Design Aid
EDCC-3 Proceedings of the Third European Dependable Computing Conference on Dependable Computing
Fault Injection into VHDL Models: Experimental Validation of a Fault Tolerant Microcomputer System
EDCC-3 Proceedings of the Third European Dependable Computing Conference on Dependable Computing
EDCC-4 Proceedings of the 4th European Dependable Computing Conference on Dependable Computing
Integrating Reliable Memory in Databases
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Integrating reliable memory in databases
The VLDB Journal — The International Journal on Very Large Data Bases
FITS: a fault injection architecture for time-triggered systems
ACSC '03 Proceedings of the 26th Australasian computer science conference - Volume 16
IOLTW '00 Proceedings of the 6th IEEE International On-Line Testing Workshop (IOLTW)
ISSRE '96 Proceedings of the The Seventh International Symposium on Software Reliability Engineering
Using fault injection to increase software test coverage
ISSRE '96 Proceedings of the The Seventh International Symposium on Software Reliability Engineering
Comparing disk and memory's resistance to operating system crashes
ISSRE '96 Proceedings of the The Seventh International Symposium on Software Reliability Engineering
An Approach to Measuring and Assessing Dependability for Critical Software Systems
ISSRE '97 Proceedings of the Eighth International Symposium on Software Reliability Engineering
Comparison of Physical and Software-Implemented Fault Injection Techniques
IEEE Transactions on Computers
Queue - Distributed Development
Analyzing heap error behavior in embedded JVM environments
Proceedings of the 2nd IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
Susceptibility of Commodity Systems and Software to Memory Soft Errors
IEEE Transactions on Computers
Reflections on Industry Trends and Experimental Research in Dependability
IEEE Transactions on Dependable and Secure Computing
Proceedings of the twentieth ACM symposium on Operating systems principles
Object duplication for improving reliability
ASP-DAC '06 Proceedings of the 2006 Asia and South Pacific Design Automation Conference
Towards availability benchmarks: a case study of software raid systems
ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
JVM susceptibility to memory errors
JVM'01 Proceedings of the 2001 Symposium on JavaTM Virtual Machine Research and Technology Symposium - Volume 1
Study of the Effects of SEU-Induced Faults on a Pipeline Protected Microprocessor
IEEE Transactions on Computers
Case-based software reliability assessmentby fault injection unified procedures
Proceedings of the 2008 international workshop on Software Engineering in east and south europe
End-to-end data integrity for file systems: a ZFS case study
FAST'10 Proceedings of the 8th USENIX conference on File and storage technologies
ERSA: error resilient system architecture for probabilistic applications
Proceedings of the Conference on Design, Automation and Test in Europe
FTCS'95 Proceedings of the Twenty-Fifth international conference on Fault-tolerant computing
Towards reliable storage systems
Towards reliable storage systems
A preliminary fault injection framework for evaluating multicore systems
SAFECOMP'12 Proceedings of the 2012 international conference on Computer Safety, Reliability, and Security
Hi-index | 0.01 |
The authors present a fault injection and monitoring environment (FINE) as a tool to study fault propagation in the UNIX kernel. FINE injects hardware-induced software errors and software faults into the UNIX kernel and traces the execution flow and key variables of the kernel. FINE consists of a fault injector, a software monitor, a workload generator, a controller, and several analysis utilities. Experiments on SunOS 4.1.2 are conducted by applying FINE to investigate fault propagation and to evaluate the impact of various types of faults. Fault propagation models are built for both hardware and software faults. Transient Markov reward analysis is performed to evaluate the loss of performance due to an injected fault. Experimental results show that memory and software faults usually have a very long latency, while bus and CPU faults tend to crash the system immediately. About half of the detected errors are data faults, which are detected when the system is tries to access an unauthorized memory location. Only about 8% of faults propagate to other UNIX subsystems. Markov reward analysis shows that the performance loss incurred by bus faults and CPU faults is much higher than that incurred by software and memory faults. Among software faults, the impact of pointer faults is higher than that of nonpointer faults.