Applying Database Support for Large Scale Data Driven Science in Distributed Environments

  • Authors:
  • Sivaramakrishnan Narayanan;Umit Catalyurek;Tahsin Kurc;Xi Zhang;Joel Saltz

  • Affiliations:
  • -;-;-;-;-

  • Venue:
  • GRID '03 Proceedings of the 4th International Workshop on Grid Computing
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

There is a rapidly growing set of applications, referredto as data driven applications, in which analysis of largeamounts of data drives the next steps taken by the scientist,e.g., running new simulations, doing additional measurements,extending the analysis to larger data collections.Critical steps in data analysis are to extract the data of interestfrom large and potentially distributed datasets andto move it from storage clusters to compute clusters forprocessing. We have developed a middleware framework,called GridDB-Lite, that is designed to efficiently supportthese two steps. In this paper, we describe the applicationof GridDB-Lite in large scale oil reservoir simulation studiesand experimentally evaluate several optimizations thatcan be employed in the GridDB-Lite runtime system.