Gridifying IBM's Generic Log Adapter to Speed-Up the Processing of Log Data

  • Authors:
  • Claudi Paniagua;Fatos Xhafa;Thanasis Daradoumis

  • Affiliations:
  • IBM GTS, Spain;Universitat Politecnica de Catalunya, Spain;Open University of Catalonia, Spain

  • Venue:
  • CISIS '07 Proceedings of the First International Conference on Complex, Intelligent and Software Intensive Systems
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Problem determination in today's computing environments consumes between 30 and 70% of an organization's IT resources and represents from one third to one half of their total cost of ownership. The first step to cutting down costs and to enable autonomic computing systems is to have all parts of the system report status in a common log data format and semantics. The Generic Log Adapter (GLA) is a generic parsing engine shipped with the IBM's Autonomic Computing Toolkit that has been conceived to convert proprietary log data into a standard log data event-based format in real time. However, in order to provide generic support for parsing the majority of today's unstructured log data formats the GLA makes heavy use of regular expressions that incur in performance limitations. Current approaches proposed to increase GLA's performance have revolved around fine-tuning the set of regular expressions used to configure the GLA for a particular log data format or writing specific parsing code. In this work we propose a very new approach consisting in transparently parallelizing the GLA by taking advantage of its internal architecture and the fact that structuring log data is a task that lends itself very well to parallelization. We present a master-worker strategy that "gridifies" the GLA efficiently in a completely transparent way for the user.