Neural networks and the bias/variance dilemma
Neural Computation
Optimal software rejuvenation for tolerating soft failures
Performance Evaluation
Analysis of Preventive Maintenance in Transactions Based Software Systems
IEEE Transactions on Computers
Statistical non-parametric algorithms to estimate the optimal software rejuvenation schedule
PRDC '00 Proceedings of the 2000 Pacific Rim International Symposium on Dependable Computing
Modeling and Analysis of Software Aging and Rejuvenation
SS '00 Proceedings of the 33rd Annual Simulation Symposium
A Methodology for Detection and Estimation of Software Aging
ISSRE '98 Proceedings of the The Ninth International Symposium on Software Reliability Engineering
A Measurement-Based Model for Estimation of Resource Exhaustion in Operational Software Systems
ISSRE '99 Proceedings of the 10th International Symposium on Software Reliability Engineering
An Approach for Estimation of Software Aging in a Web Server
ISESE '02 Proceedings of the 2002 International Symposium on Empirical Software Engineering
Software Rejuvenation: Analysis, Module and Applications
FTCS '95 Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant Computing
Combining statistical monitoring and predictable recovery for self-management
WOSS '04 Proceedings of the 1st ACM SIGSOFT workshop on Self-managed systems
Proactive management of software aging
IBM Journal of Research and Development
A survey of online failure prediction methods
ACM Computing Surveys (CSUR)
Hi-index | 0.00 |
We propose a new paradigm for software availability enhancement. We offer a two-step strategy: Failure prediction followed by maintenance actions with the objective of avoiding impending failures or minimizing the effort of their repair. For the first step we present two failure prediction methods: universal basis functions (UBF) and similar events prediction (SEP), which are based on probabilistic analysis. The potential of the presented methods is evaluated by a case-study where failures of a commercial telecommunication platform have been predicted. The second step includes existing maintenance methods fitting the proposed approach and a new recovery strategy called “adaptive recovery blocks”. Since system availability enhancement is the overall goal, equations to calculate availability of such a system are given as well.