On a scheduling problem where a job can be executed only by a limited number of processors
Computers and Operations Research
The grid: blueprint for a new computing infrastructure
The grid: blueprint for a new computing infrastructure
File and Object Replication in Data Grids
Cluster Computing
Allocating Data and Operations to Nodes in Distributed Database Design
IEEE Transactions on Knowledge and Data Engineering
Replica Selection in the Globus Data Grid
CCGRID '01 Proceedings of the 1st International Symposium on Cluster Computing and the Grid
The Globus Project: A Status Report
HCW '98 Proceedings of the Seventh Heterogeneous Computing Workshop
Predicting Sporadic Grid Data Transfers
HPDC '02 Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing
A Decentralized, Adaptive Replica Location Mechanism
HPDC '02 Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing
Predicting the Performance of Wide Area Data Transfers
IPDPS '02 Proceedings of the 16th International Symposium on Parallel and Distributed Processing
Enabling the Co-Allocation of Grid Data Transfers
GRID '03 Proceedings of the 4th International Workshop on Grid Computing
The Anatomy of the Grid: Enabling Scalable Virtual Organizations
International Journal of High Performance Computing Applications
Replica selection on co-allocation data grids
ISPA'04 Proceedings of the Second international conference on Parallel and Distributed Processing and Applications
An on-line replication strategy to increase availability in Data Grids
Future Generation Computer Systems
Efficient reuse of replicated parallel data segments in computational grids
Future Generation Computer Systems
A dynamic data replication strategy using access-weights in data grids
The Journal of Supercomputing
Future Generation Computer Systems
Memory-Mapped File Approach for On-Demand Data Co-allocation on Grids
CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
A DSM-based fragmented data sharing framework for grids
Future Generation Computer Systems
Group-based self-organization grid architecture
GPC'07 Proceedings of the 2nd international conference on Advances in grid and pervasive computing
Transparent on-demand co-allocation data access for grids
International Journal of Ad Hoc and Ubiquitous Computing
FIRE: A File Reunion Based Data Replication Strategy for Data Grids
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Dynamic data replication strategy based on federation data grid systems
ICICA'10 Proceedings of the First international conference on Information computing and applications
Improving job scheduling performance with parallel access to replicas in Data Grid environment
The Journal of Supercomputing
Dynamic QoS-aware data replication in grid environments based on data "importance"
Future Generation Computer Systems
PDDRA: A new pre-fetching based dynamic data replication algorithm in data grids
Future Generation Computer Systems
Network coding for creating replica in grid environments
HPCA'09 Proceedings of the Second international conference on High Performance Computing and Applications
Efficient Management of Fragmented Replica in Data Grids
International Journal of Grid and High Performance Computing
Decreasing power consumption with energy efficient data aware strategies
Future Generation Computer Systems
Hi-index | 0.00 |
Data Grids support data-intensive applications in wide area Grid systems. They utilize local storage systems as distributed data stores by replicating datasets. Replication is a commonly used technique in a distributed environment. The motivation of replication is that replication can improve data availability, data access performance, and load balancing. Usually a complete file is copied to many Grid sites for local access. However, a site may only need parts of a replica. Therefore, to use the storage systems efficiently, it is necessary for a Grid site to store only parts of a replica. In this paper, we propose a concept called fragmented replicas. That is, when doing replication, a site can store only some partial contents needed locally. It can greatly save the storage space wasted in storing unused data. We also propose a block mapping procedure to determine the distribution of blocks in every available server for later replica retrieval. According to this procedure, a server can provide its available partial replica contents for other members in the Grid system to access. On the other hand, a client can retrieve a fragmented replica directly by using the block mapping procedure. After the block mapping procedure, some co-allocation schemes can be used to retrieve data sets from the available servers. The simulation shows that the co-allocation schemes also improve download performance in a fragmented replication system.