Predicting Sporadic Grid Data Transfers
HPDC '02 Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing
Optimizing GridFTP through Dynamic Right-Sizing
HPDC '03 Proceedings of the 12th IEEE International Symposium on High Performance Distributed Computing
MSS '01 Proceedings of the Eighteenth IEEE Symposium on Mass Storage Systems and Technologies
On Modeling GridFTP Using Fluid-Flow Approximation for High Speed Grid Networking
SAINT-W '04 Proceedings of the 2004 Symposium on Applications and the Internet-Workshops (SAINT 2004 Workshops)
Using Regression Techniques to Predict Large Data Transfers
International Journal of High Performance Computing Applications
The Globus Striped GridFTP Framework and Server
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Improving GridFTP Performance with Split TCP Connections
E-SCIENCE '05 Proceedings of the First International Conference on e-Science and Grid Computing
Automatic Parameter Configuration Mechanism for Data Transfer Protocol GridFTP
SAINT '06 Proceedings of the International Symposium on Applications on Internet
A taxonomy of Data Grids for distributed data sharing, management, and processing
ACM Computing Surveys (CSUR)
GridFTP-APT: Automatic Parallelism Tuning Mechanism for Data Transfer Protocol GridFTP
CCGRID '06 Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid
How are Real Grids Used? The Analysis of Four Grid Traces and Its Implications
GRID '06 Proceedings of the 7th IEEE/ACM International Conference on Grid Computing
Characterising a grid site's traffic
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
User-steering of HPC workflows: state-of-the-art and future directions
Proceedings of the 2nd ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies
Hi-index | 0.00 |
One of the basic services in grids is the transfer of data between remote machines. Files may be transferred at the explicit request of the user or as part of delegated resource management services, such as data replication or job scheduling. GridFTP is an important tool for such data transfers since it builds on the common FTP protocol, has a large user base with multiple implementations, and it uses the GSI security model that allows delegated operations. This paper presents a workload analysis of the implementation of the GridFTP protocol provided by the Globus Toolkit. We studied more than 1.5 years of traces reported from all over the world by Globus GridFTP installed components. Our study focuses on three dimensions: first, it quantifies the volume of data transferred and characterizes user behavior. Second, it attempts to show how tuning capabilities are used in practice. Finally, it quantifies the user base as recorded in the database and highlights the usage trends of this software component.