A dependability layer for large-scale distributed systems

  • Authors:
  • Valentin Cristea;C. Dobre;F. Pop;C. Stratan;A. Costan;C. Leordeanu;E. Tirsa

  • Affiliations:
  • Department of Computer Science, University Politehnica of Bucharest, Spl. Independentei 313, Bucharest, Romania.;Department of Computer Science, University Politehnica of Bucharest, Spl. Independentei 313, Bucharest, Romania.;Department of Computer Science, University Politehnica of Bucharest, Spl. Independentei 313, Bucharest, Romania.;Department of Computer Science, University Politehnica of Bucharest, Spl. Independentei 313, Bucharest, Romania.;Department of Computer Science, University Politehnica of Bucharest, Spl. Independentei 313, Bucharest, Romania.;Department of Computer Science, University Politehnica of Bucharest, Spl. Independentei 313, Bucharest, Romania.;Department of Computer Science, University Politehnica of Bucharest, Spl. Independentei 313, Bucharest, Romania

  • Venue:
  • International Journal of Grid and Utility Computing
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Ensuring dependability in large-scale distributed systems represents an important research subject today. Despite the fact that many projects obtained valuable results in this domain, no acceptable solution was yet found that could integrate all the requirements for designing a dependable system and that could exploit all the capabilities of modern systems. We present a unitary and aggregate approach to ensuring reliability, availability, safety and security of distributed systems. Starting from the proposed architecture, we present implementation details for two solutions designed to ensure fault tolerance, using virtualisation and container-based replication of services. We also present an approach to enhance security using combined modern security models in large-scale distributed systems. The results and implementation details can serve as a methodology to assist distributed infrastructures in adopting such a middleware layer designed to enforce dependability in large-scale distributed systems.