DataMover: Robust Terabyte-Scale Multi-file Replication over Wide-Area Networks

  • Authors:
  • Alex Sim;Junmin Gu;Arie Shoshani;Vijaya Natarajan

  • Affiliations:
  • Lawrence Berkeley National Laboratory;Lawrence Berkeley National Laboratory;Lawrence Berkeley National Laboratory;Lawrence Berkeley National Laboratory

  • Venue:
  • SSDBM '04 Proceedings of the 16th International Conference on Scientific and Statistical Database Management
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Typically, large scientific datasets (order of terabytes)are generated at large computational centers,and stored on mass storage systems. However, largesubsets of the data need to be moved to facilitiesavailable to application scientists for analysis. Filereplication of thousands of files is a tedious, errorprone, but extremely important task in scientificapplications. The automation of the file replicationtask requires automatic space acquisition and reuse,and monitoring the progress of staging thousands offiles from the source mass storage system, transferringthem over the network, archiving them at thetarget mass storage system or disk systems, andrecovering from transient system failures. We havedeveloped a robust replication system, calledDataMover, which is now in regular use in High-Energy-Physics and Climate modeling experiments. Only a single command is necessary to request multi-filereplication or the replication of an entiredirectory. A web-based tool was developed todynamically monitor the progress of the multi-filereplication process.