Measuring reliability of computation center software
ICSE '78 Proceedings of the 3rd international conference on Software engineering
A Statistical Failure/Load Relationship: Results of a Multicomputer Study
IEEE Transactions on Computers
IEEE Transactions on Software Engineering
The evolution of the MVS operating system
IBM Journal of Research and Development
Measurement-based Analysis of Networked System Availability
Performance Evaluation: Origins and Directions
Measurement-Based Analysis of System Dependability Using Fault Injection and Field Failure Data
Performance Evaluation of Complex Systems: Techniques and Tools, Performance 2002, Tutorial Lectures
Failure Data Analysis of a LAN of Windows NT Based Computers
SRDS '99 Proceedings of the 18th IEEE Symposium on Reliable Distributed Systems
FTCS'95 Proceedings of the Twenty-Fifth international conference on Fault-tolerant computing
Hi-index | 14.98 |
This paper describes an analysis of system detected software errors on the MVS operating system at the Center for Information Technology (CIT), Stanford University. The analysis procedure demonstrates a methodology by which systems with automatic recovery features can be evaluated. Most common error categories are determined and related to the program in execution at the time of the error. The severity of the error is measured by evaluating the criticality of the program for continued system operation. The system recovery and error correction features are then analyzed and an estimate of the system fault tolerance to errors of different levels of severity is made.