AHAFS subsystem for enhancing operating system health in the cloud computing era

  • Authors:
  • J. Jann;N. Dubey;R. S. Burugula;P. Pattnaik

  • Affiliations:
  • IBM Research Division, Thomas J. Watson Research Center, Yorktown Heights, NY;IBM Research Division, Thomas J. Watson Research Center, Yorktown Heights, NY;IBM Research Division, Thomas J. Watson Research Center, Yorktown Heights, NY;IBM Research Division, Thomas J. Watson Research Center, Yorktown Heights, NY

  • Venue:
  • IBM Journal of Research and Development
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Cloud computing is becoming an important paradigm for information technology (IT) infrastructures. A key advantage of cloud computing is its ability to decrease the cost of computing for a variety of applications. For the infrastructure layer, which includes servers and operating systems (OSs), cost reduction is achieved through the consolidation of many OS instances onto a single physical server and through significant improvements in system-administrator productivity. However, these productivity improvements require a scalable easy-to-use OS event-management subsystem so that the higher level software can reliably and effectively automate operations. This paper presents the design and implementation of an OS event-monitoring subsystem called Autonomic Health Advisor FileSystem (AHAFS) that is scalable, has little system-performance overhead, and provides instantaneous event notifications with useful information, for improving the robustness, security, and performance of an OS instance. AHAFS is extensible and can monitor various events for a wide variety of event consumers. AHAFS does not require the use of a new application programming interface (API) but uses the ubiquitous filesystem API; hence, AHAFS is usable by monitoring applications written in any of the languages commonly used for monitoring (e.g., C, C++, Perl®, and Java®) without requiring additional runtime modules or packages.