Analysis of Failure and Recovery Rates in a Wireless Telecommunications System

  • Authors:
  • Steven M. Matz;Lawrence G. Votta;Mohammad Malkawi

  • Affiliations:
  • -;-;-

  • Venue:
  • DSN '02 Proceedings of the 2002 International Conference on Dependable Systems and Networks
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

We derive estimates of mean time to failure and mean time to recover/repair for both hardware and software in a large wireless telecommunications system, based on six months of manually recorded outage data. The observed failure and recovery distributions are not consistent with simple exponential processes. The data can be described by Weibull or two-stage hyper-exponential distributed processes. The duration distributions for scheduled and unscheduled software outages have very different characteristics. The complex distributions observed may be the composition of simple independent processes which cannot be separated in this data set due to a lack of adequately detailed information or proper characterization of outage causes.In this system we found a coverage of \sim 98% for auto-recoveryfrom unscheduled software failures with an auto-repairfraction of \sim 36%.