A case for redundant arrays of inexpensive disks (RAID)
SIGMOD '88 Proceedings of the 1988 ACM SIGMOD international conference on Management of data
Identifying Dynamic Replication Strategies for a High-Performance Data Grid
GRID '01 Proceedings of the Second International Workshop on Grid Computing
Evaluation of an Economy-Based File Replication Strategy for a Data Grid
CCGRID '03 Proceedings of the 3st International Symposium on Cluster Computing and the Grid
Towards an Economy-Based Optimisation of File Access and Replication on a Data Grid
CCGRID '02 Proceedings of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid
SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
A Peer-to-Peer Replica Location Service Based on a Distributed Hash Table
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Massive High-Performance Global File Systems for Grid computing
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
An efficient replicated data access approach for large-scale distributed systems
CCGRID '04 Proceedings of the 2004 IEEE International Symposium on Cluster Computing and the Grid
A taxonomy of Data Grids for distributed data sharing, management, and processing
ACM Computing Surveys (CSUR)
Job scheduling and data replication on data grids
Future Generation Computer Systems
Intelligent Scheduling and Replication in Datagrids: a Synergistic Approach
CCGRID '07 Proceedings of the Seventh IEEE International Symposium on Cluster Computing and the Grid
PVFS: a parallel file system for linux clusters
ALS'00 Proceedings of the 4th annual Linux Showcase & Conference - Volume 4
An on-line replication strategy to increase availability in Data Grids
Future Generation Computer Systems
A Proactive Non-Cooperative Game-Theoretic Framework for Data Replication in Data Grids
CCGRID '08 Proceedings of the 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid
Optimizing workflow data footprint
Scientific Programming - Dynamic Computational Workflows: Discovery, Optimization and Scheduling
Dynamic data replication in LCG 2008
Concurrency and Computation: Practice & Experience - UK e-Science All Hands Meeting 2006
Wide area data replication for scientific collaborations
International Journal of High Performance Computing and Networking
Data placement for scientific applications in distributed environments
GRID '07 Proceedings of the 8th IEEE/ACM International Conference on Grid Computing
The Globus Replica Location Service: Design and Experience
IEEE Transactions on Parallel and Distributed Systems
Data Staging Strategies and Their Impact on the Execution of Scientific Workflows
Proceedings of the second international workshop on Data-aware distributed computing
Access-pattern and bandwidth aware file replication algorithm in a grid environment
GRID '08 Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing
Dynamic replication algorithms for the multi-tier Data Grid
Future Generation Computer Systems - Special issue: Parallel computing technologies
The impact of data replication on job scheduling performance in the Data Grid
Future Generation Computer Systems
The complexity of static data replication in data grids
Parallel Computing
The NorduGrid architecture and middleware for scientific applications
ICCS'03 Proceedings of the 1st international conference on Computational science: PartI
Enabling Lustre WAN for production use on the TeraGrid: a lightweight UID mapping scheme
Proceedings of the 2010 TeraGrid Conference
File replication, maintenance, and consistency management services in data grids
The Journal of Supercomputing
Gridifying a Diffusion Tensor Imaging Analysis Pipeline
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Performance Analysis of Diffusion Tensor Imaging in an Academic Production Grid
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
FIRE: A File Reunion Based Data Replication Strategy for Data Grids
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
The Hadoop Distributed File System
MSST '10 Proceedings of the 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)
Data Replication and Power Consumption in Data Grids
CLOUDCOM '10 Proceedings of the 2010 IEEE Second International Conference on Cloud Computing Technology and Science
GPFS: a shared-disk file system for large computing clusters
FAST'02 Proceedings of the 1st USENIX conference on File and storage technologies
Hi-index | 0.00 |
This paper presents a classification of file placement and replication methods on grids. The study is motivated by file transfer issues encountered in the Virtual Imaging Platform deployed on the European Grid Infrastructure. Approaches proposed in the last 6 years are classified using taxonomies of replication process, replication optimization, file models, resource models and replication validation. Most existing approaches implement file replication as a middleware service, using dynamic strategies. Production approaches are slightly different than works evaluated in simulation or in controlled conditions which (i) mostly assumes simplistic file models (undistinguished read-only files), (ii) rely on elaborated access patterns, (iii) assume clairvoyance of the infrastructure parameters and (iv) study file availability less than other metrics but insist on cost.