Developing autonomic distributed scientific applications: a case study from history matching using ensemblekalman-filters

  • Authors:
  • Yaakoub El-Khamra;Shantenu Jha

  • Affiliations:
  • University of Texas Austin, Austin, Texas, USA & Louisiana State University, Baton Rouge, LA, USA;Louisiana State University, Baton Rouge, LA, USA & University of Edinburgh, United Kingdom

  • Venue:
  • GMAC '09 Proceedings of the 6th international conference industry session on Grids meets autonomic computing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The development of a simple effective distributed applications that can utilize multiple distributed resources remains challenging. Therefore, not surprisingly, it is difficult to implement advanced application characteristics - such as autonomic behaviour for distributed applications. Notwithstanding, there exist a large class of applications which could benefit immensely with support for autonomic properties and behaviour. For example, many applications have irregular and highly variable resource requirements which are very difficult to predict in advance. As a consequence of irregular execution characteristics, dynamic resource requirements are difficult to predict a priori thus rendering static resource mapping techniques such as work flows ineffective; in general the resource utilization problem can be addressed more efficiently using autonomic approaches. This paper discusses the design and development of a prototype framework that can support many of the requirements of Autonomic applications that desire to use Computational Grids. We provide here an initial description of the features and the architecture of the Lazarus framework developed using SAGA, integrate it with an Ensemble Kalman Filter application, and demonstrate the advantages - performance and lower development cost, of the framework. As proof of concept we deploy Lazarus on several different machines on the TeraGrid, and show the effective utilization of several heterogeneous resources and distinct performance enhancements that autonomics provides. Careful analysis provides insight into the primary reason underlying the performance improvements, namely a late-binding and an optimal choice of the configuration of resources selected.