Stork: Making Data Placement a First Class Citizen in the Grid

  • Authors:
  • Tevfik Kosar;Miron Livny

  • Affiliations:
  • -;-

  • Venue:
  • ICDCS '04 Proceedings of the 24th International Conference on Distributed Computing Systems (ICDCS'04)
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Todays scientific applications have huge data requirements which continue to increase drastically every year. These data are generally accessed by many users from all across the the globe. This implies a major necessity to move huge amounts of data around wide area networks to complete the computation cycle, which brings with it the problem of efficient and reliable data placement. The current approach to solve this problem of data placement is either doing it manually, or employing simple scripts which do not have any automation or fault tolerance capabilities. Our goal is to make data placement activities first class citizens in the Grid just like the computational jobs. They will be queued, scheduled, monitored, managed, and even check-pointed. More importantly, it will be made sure that they complete successfully and without any human interaction. We also believe that data placement jobs should be treated differently from computational jobs, since they may have different semantics and different characteristics. For this purpose, we have developed Stork, a scheduler for dataplacement activities in the Grid.