Automated incident management for a platform-as-a-service cloud

  • Authors:
  • Soumitra Sarkar;Ruchi Mahindru;Rafah A. Hosn;Norbert Vogl;HariGovind V. Ramasamy

  • Affiliations:
  • IBM T. J. Watson Research Center, New York;IBM T. J. Watson Research Center, New York;IBM T. J. Watson Research Center, New York;IBM T. J. Watson Research Center, New York;IBM T. J. Watson Research Center, New York

  • Venue:
  • Hot-ICE'11 Proceedings of the 11th USENIX conference on Hot topics in management of internet, cloud, and enterprise networks and services
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Cloud-based offerings such as Infrastructure-as-a-service (IaaS), Platform-as-a-Service (PaaS), and Software-as-a-Service (SaaS), are being delivered by various vendors at highly competitive prices to encourage a paradigm shift to utility computing. To optimize the operational costs of managing an IBM Cloud-based PaaS offering, a two-pronged approach has been adopted: simplification of enterprise-class data center management processes currently used in IBM's Global Services Strategic Outsourcing accounts, and automation of the simplified processes. This paper describes a framework that the authors have developed to deliver an integrated monitoring and event correlation system, and an event-driven Automated Incident Management System, for IBM's Smart Business Dev/Test Cloud offering.