On the Design, Development, Deployment, and Network Survivability Analysis of the Dynamic Routing System Protocol

  • Authors:
  • Abdur Chowdhury;Ophir Frieder;Peng-Jun Wan

  • Affiliations:
  • Department of Computer Science, Illinois Institute of Technology abdur@cs.iit.edu;Department of Computer Science, Illinois Institute of Technology ophir@cs.iit.edu;Department of Computer Science, Illinois Institute of Technology wan@cs.iit.edu

  • Venue:
  • The Journal of Supercomputing
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

With the ever-increasing demands on server applications, reliability is of paramount importance. Often these services are implemented using a distributed server cluster architecture where many servers act together providing end user services. We evaluated one hundred deployed systems and found that over a one-year period, thirteen percent of the hardware failures were network related. To reliably provide end-user services, the server clusters must guarantee server-to-server communication in the presence of these network failures. We describe a protocol designed to provide proactive dynamic routing for server clusters architectures called the Dynamic Routing System (DRS) protocol and present analysis to its survivability in the presence of network failure. Our experiments show that, for an eight-node server cluster with three concurrent network failures, the DRS provides a 267% improvement in the probability of server to server communication over a traditional network topology. Additionally, the proactive routing approach of the DRS performs better than traditional routing systems by fixing network problems before they affect application communication.