A Centralized Data Access Model for Grid Computing

  • Authors:
  • Phil Andrews;Tom Sherwin;Bryan Banister

  • Affiliations:
  • -;-;-

  • Venue:
  • MSS '03 Proceedings of the 20 th IEEE/11 th NASA Goddard Conference on Mass Storage Systems and Technologies (MSS'03)
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

Global access to storage is a common theme of Grid Computing, with access mechanisms often enforcing a major restriction on the distribution of significant applications across a computational grid. The established approach is to distribute the data with the jobs, sometimesrequiring lengthy delays on job completion and the necessity for significant resource discovery to establish local data capabilities. For applications that are truly data intensive, this may render them either highly inefficient, or even incapable of using grid computing environments.In this paper we describe a different approach, where the opportunity to design a grid environment from scratch is used to build a tightly-coupled, data-oriented infrastructure that leverages deep investment in leading edge technology to provide very high-speed, widespread access to large data storage. Results from a geographically distributed Grid established for the Supercomputing 2002 conference, using preliminary TeraGrid infrastructure, are included and show encouraging performance including data transfer rates of over 700 MB/s using eight 1 Gb/s links from a Storage Area Network to a 10 Gb/s Wide Area Network