Scheduling and simulation: how to upgrade distributed systems

  • Authors:
  • Sameer Ajmani;Barbara Liskov;Liuba Shrira

  • Affiliations:
  • Massachusetts Institute of Technology, Laboratory for Computer Science;Massachusetts Institute of Technology, Laboratory for Computer Science;Brandeis University, Computer Science Department

  • Venue:
  • HOTOS'03 Proceedings of the 9th conference on Hot Topics in Operating Systems - Volume 9
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

Upgrading the software of long-lived distributed systems is difficult. It is not possible to upgrade all the nodes in a system at once, since some nodes may be down and halting the system for an upgrade is unacceptable. This means that different nodes may be running different software versions and yet need to communicate, even though those versions may not be fully compatible. We present a methodology and infrastructure that addresses these challenges and makes it possible to upgrade distributed systems automatically while limiting service disruption.