Failure resilient real-time data federation system

  • Authors:
  • Aakanksha Gagrani;Brijesh Pillai;Srikumar Krishnamoorthy

  • Affiliations:
  • Infosys Technologies Pvt. Ltd.;Infosys Technologies Pvt. Ltd.;Infosys Technologies Pvt. Ltd.

  • Venue:
  • SpringSim '09 Proceedings of the 2009 Spring Simulation Multiconference
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data federation systems virtualize access to enterprize data resources by integrating data from disparate and heterogeneous operational data sources in an on-demand and real-time basis. The key challenge of low latency data access in such real-time data federation systems can be addressed by grid based scale-out architecture. However, failure of resources in the grid can pose serious challenges in data federation as the query processing is federated over multiple grid nodes. In such real-time data federation systems, it is often desirable to recover from failure and continue operation rather than repeat the entire process. This paper proposes a decentralized failure-recovery protocol for data federation system using data spaces based architecture. The generic nature of the protocol makes it extensible to applications other than data federation system as well. Moreover, the protocol does not make any assumptions about the availability of any central repository for recovering from failure. We implement the proposed failure recovery protocol in a simulation environment and present the key findings of the study.