Field testing for cosmic ray soft errors in semiconductor memories
IBM Journal of Research and Development - Special issue: terrestrial cosmic rays and soft errors
G4: A Fault-Tolerant CMOS Mainframe
FTCS '98 Proceedings of the The Twenty-Eighth Annual International Symposium on Fault-Tolerant Computing
Custom S/390 G5 and G6 microprocessors
IBM Journal of Research and Development
Event monitoring in highly complex hardware systems
IBM Journal of Research and Development
Semi-hierarchical approach for reliability, availability, and serviceability of cellular systems
ACM SIGARCH Computer Architecture News
Run-control migration from single book to multibooks
IBM Journal of Research and Development
The z990 first error data capture concept
IBM Journal of Research and Development
DATE '03 Proceedings of the conference on Design, Automation and Test in Europe - Volume 1
Enhanced I/O subsystem recovery and availability on the IBM System z9
IBM Journal of Research and Development
Fully redundant clock generation and distribution with dynamic oscillator switchover
IBM Journal of Research and Development
Reducing planned outages for book hardware maintenance with concurrent book replacement
IBM Journal of Research and Development
IBM Journal of Research and Development
Practical software reuse for IBM System z I/O subsystems
IBM Journal of Research and Development
Proceedings of the 13th international conference on Architectural support for programming languages and operating systems
IBM Journal of Research and Development
Concepts for Autonomous Control Flow Checking for Embedded CPUs
ATC '08 Proceedings of the 5th international conference on Autonomic and Trusted Computing
End-to-end register data-flow continuous self-test
Proceedings of the 36th annual international symposium on Computer architecture
Concepts for run-time and error-resilient control flow checking of embedded RISC CPUs
International Journal of Autonomous and Adaptive Communications Systems
RAS design for the IBM eServer z900
IBM Journal of Research and Development
The alternate support element, a high-availability service console for the IBM eServer z900
IBM Journal of Research and Development
A rapid prototyping system for error-resilient multi-processor systems-on-chip
Proceedings of the Conference on Design, Automation and Test in Europe
Unified resource manager virtualization management
IBM Journal of Research and Development
Journal of Electronic Testing: Theory and Applications
Hi-index | 0.00 |
The Reliability/Availability/Serviceability (RAS) strategy for S/390® G5 and G6 is to continue the S/390 objective of providing Continuous Reliable Operation (CRO). The RAS strategy is constructed with a set of building blocks which work closely together: error prevention, error detection, error recovery, problem determination, service structure, change management, and RAS measurement and analysis. The interdependency among the building blocks is such that removing or weakening any of them limits the ability of the design to achieve the overall CRO objective. Each building block must be fully implemented and must execute flawlessly within itself and together with the other blocks.