Scalability of network-failure resilience: analysis using multi-layer probabilistic graphical models

  • Authors:
  • Guanglei Liu;Chuanyi Ji

  • Affiliations:
  • Department of Computer Science, Roane State Community College, Harriman, TN and School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA;School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA

  • Venue:
  • IEEE/ACM Transactions on Networking (TON)
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this work, we quantify scalability of network re-silience upon failures. We characterize resilience as the percentage of lost traffic upon failures and define scalability as the growth rate of the percentage of lost traffic with respect to network size, link failure probability, and network traffic for given failure protection schemes. We apply probabilistic graphical models to characterize statistical dependence between physical-layer failures and the net-work-layer traffic, and analyze the scalability for large networks of different topologies. We first focus on the scalability of resilience for regular topolo-gies under uniform deterministic traffic with independent and de-pendent link failures, with and without protection. For large net-works with small probabilities of failures and without protection, we show that the scalability of network resilience grows linearly with the average route length and with the "effective" link failure probability. For large networks with 1 + 1 protection, we obtain lower and upper bound of the percentage of lost traffic. We de-rive approximations of the scalability for arbitrary topologies, and attain close-form analytical results for ring, star, and mesh-torus topologies. We then study network resilience under random traffic with Poisson arrivals. We find that when the network is under light load, the network resilience is reduced to that under uniform de-terministic traffic. When the network load is under heavy load, the percentage of lost traffic approaches the marginal probability of link failure. Our scalability analysis shows explicitly how network resilience varies with different factors and provides insights for re-silient network design.