Data transfers in the grid: workload analysis of globus GridFTP

  • Authors:
  • Nicolas Kourtellis;Lydia Prieto;Adriana Iamnitchi;Gustavo Zarrate;Dan Fraser

  • Affiliations:
  • University of South Florida, Tampa, FL, USA;University of South Florida, Tampa, FL, USA;University of South Florida, Tampa, FL, USA;University of South Florida, Tampa, FL, USA;Argonne National Laboratory, Chicago, IL, USA

  • Venue:
  • DADC '08 Proceedings of the 2008 international workshop on Data-aware distributed computing
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

One of the basic services in grids is the transfer of data between remote machines. Files may be transferred at the explicit request of the user or as part of delegated resource management services, such as data replication or job scheduling. GridFTP is an important tool for such data transfers since it builds on the common FTP protocol, has a large user base with multiple implementations, and it uses the GSI security model that allows delegated operations. This paper presents a workload analysis of the implementation of the GridFTP protocol provided by the Globus Toolkit. We studied more than 1.5 years of traces reported from all over the world by Globus GridFTP installed components. Our study focuses on three dimensions: first, it quantifies the volume of data transferred and characterizes user behavior. Second, it attempts to show how tuning capabilities are used in practice. Finally, it quantifies the user base as recorded in the database and highlights the usage trends of this software component.