Self-Management of systems through automatic restart

  • Authors:
  • Katinka Wolter

  • Affiliations:
  • Institut für Informatik, Humboldt-Universität zu Berlin, Berlin, Germany

  • Venue:
  • Self-star Properties in Complex Information Systems
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Modern complex information systems require management mechanisms that operate to a large extent independently and autonomously. One such mechanism is the restart of components or transactions in case a failure in the system occurs. In this paper we introduce a pragmatic algorithm to determine close to optimal restart times on-line. We present a method for choosing best restart times based on empirical data, if no theoretical distribution is known. The best restart time is determined based on the empirical hazard rate. We study the sample size required to come to a reasonably good estimate, the effect of the failure probability of a job and issues of parameter selection for the hazard rate estimation. The application considered in this paper is the connection setup time in HTTP GET necessary for the download of web pages.