Characterization of failures in an operational IP backbone network

  • Authors:
  • Athina Markopoulou;Gianluca Iannaccone;Supratik Bhattacharyya;Chen-Nee Chuah;Yashar Ganjali;Christophe Diot

  • Affiliations:
  • Department of Electrical Engineering and Computer Science, University of California at Irvine, Irvine, CA;Intel Research, Berkeley, CA;Snaptell Inc., Mountain View, CA;Department of Electrical and Computer Engineering, University of California at Davis, Davis, CA;Department of Computer Science, University of Toronto, Bahen Center for Information Technology, Toronto, ON, Canada;Thomson R&D, Paris, France

  • Venue:
  • IEEE/ACM Transactions on Networking (TON)
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

As the Internet evolves into a ubiquitous communication infrastructure and supports increasingly important services, its dependability in the presence of various failures becomes critical. In this paper, we analyze IS-IS routing updates fromthe Sprint IP backbone network to characterize failures that affect IP connectivity. Failures are first classified based on patterns observed at the IP-layer; in some cases, it is possible to further infer their probable causes, such as maintenance activities, router-related and optical layer problems. Key temporal and spatial characteristics of each class are analyzed and, when appropriate, parameterized using well-known distributions. Our results indicate that 20% of all failures happen during a period of scheduled maintenance activities. Of the unplanned failures, almost 30% are shared by multiple links and are most likely due to router-related and optical equipment-related problems, respectively, while 70% affect a single link at a time. Our classification of failures reveals the nature and extent of failures in the Sprint IP backbone. Furthermore, our characterization of the different classes provides a probabilistic failure model, which can be used to generate realistic failure scenarios, as input to various network design and traffic engineering problems.