A framework for self-optimizing, fault-tolerant, high performance bulk data transfers in a heterogeneous grid environment

  • Authors:
  • Tevfik Kosar;George Kola;Miron Livny

  • Affiliations:
  • Computer Sciences Department, University of Wisconsin-Madison, Madison WI;Computer Sciences Department, University of Wisconsin-Madison, Madison WI;Computer Sciences Department, University of Wisconsin-Madison, Madison WI

  • Venue:
  • ISPDC'03 Proceedings of the Second international conference on Parallel and distributed computing
  • Year:
  • 2003
  • On multipath routing with transit hubs

    NETWORKING'05 Proceedings of the 4th IFIP-TC6 international conference on Networking Technologies, Services, and Protocols; Performance of Computer and Communication Networks; Mobile and Wireless Communication Systems

Quantified Score

Hi-index 0.00

Visualization

Abstract

The drastic increase in the data requirements of scientific applications combined with an increasing trend towards collaborative research has resulted in the need to transfer large amounts of data among the participating sites. The general approach to transferring such large amounts of data has been to either dump data to tapes and mail them or employ scripts with an operator at each site to babysit the transfers to deal with failures. We introduce a framework which automates the whole process of data movement between different sites. The framework does not require any human intervention and it can recover automatically from various kinds of storage system, network, and software failures, guaranteeing completion of the transfers. The framework has sophisticated monitoring and tuning capability that increases the performance of the data transfers on the fly. The framework also generates on-the-fly visualization of the transfers making identification of problems and bottlenecks in the system simple.