CoRAL: A transparent fault-tolerant web service

  • Authors:
  • Navid Aghdaie;Yuval Tamir

  • Affiliations:
  • Concurrent Systems Laboratory, UCLA Computer Science Department, Los Angeles, CA 90095, United States;Concurrent Systems Laboratory, UCLA Computer Science Department, Los Angeles, CA 90095, United States

  • Venue:
  • Journal of Systems and Software
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Web is increasingly used for critical applications and services. We present a client-transparent mechanism, called CoRAL, that provides high reliability and availability for Web service. CoRAL provides fault tolerance even for requests being processed at the time of server failure. The scheme does not require deterministic servers and can thus handle dynamic content. CoRAL actively replicates the TCP connection state while maintaining logs of HTTP requests and replies. In the event of a primary server failure, active client connections fail over to a spare, where their processing continues seamlessly. We describe key aspects of the design and implementation as well as several performance optimizations. Measurements of system overhead, failover performance, and preliminary validation using fault injection are presented.