Complete and fragmented replica selection and retrieval in Data Grids

Authors:
Ruay-Shiung Chang;Po-Hung Chen
Affiliations:
Department of Computer Science and Information Engineering, National Dong Hwa University, Shoufeng, Hualien, 974, Taiwan;Department of Computer Science and Information Engineering, National Dong Hwa University, Shoufeng, Hualien, 974, Taiwan
Venue:
Future Generation Computer Systems
Year:
2007

Citing 12
Cited 16

On a scheduling problem where a job can be executed only by a limited number of processors

Computers and Operations Research
The grid: blueprint for a new computing infrastructure

The grid: blueprint for a new computing infrastructure
File and Object Replication in Data Grids

Cluster Computing
Allocating Data and Operations to Nodes in Distributed Database Design

IEEE Transactions on Knowledge and Data Engineering
Replica Selection in the Globus Data Grid

CCGRID '01 Proceedings of the 1st International Symposium on Cluster Computing and the Grid
The Globus Project: A Status Report

HCW '98 Proceedings of the Seventh Heterogeneous Computing Workshop
Predicting Sporadic Grid Data Transfers

HPDC '02 Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing
A Decentralized, Adaptive Replica Location Mechanism

HPDC '02 Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing
Predicting the Performance of Wide Area Data Transfers

IPDPS '02 Proceedings of the 16th International Symposium on Parallel and Distributed Processing
Enabling the Co-Allocation of Grid Data Transfers

GRID '03 Proceedings of the 4th International Workshop on Grid Computing
The Anatomy of the Grid: Enabling Scalable Virtual Organizations

International Journal of High Performance Computing Applications
Replica selection on co-allocation data grids

ISPA'04 Proceedings of the Second international conference on Parallel and Distributed Processing and Applications

An on-line replication strategy to increase availability in Data Grids

Future Generation Computer Systems
Efficient reuse of replicated parallel data segments in computational grids

Future Generation Computer Systems
A dynamic data replication strategy using access-weights in data grids

The Journal of Supercomputing
Ontology-based content organization and retrieval for SCORM-compliant teaching materials in data grids

Future Generation Computer Systems
Memory-Mapped File Approach for On-Demand Data Co-allocation on Grids

CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
A DSM-based fragmented data sharing framework for grids

Future Generation Computer Systems
Group-based self-organization grid architecture

GPC'07 Proceedings of the 2nd international conference on Advances in grid and pervasive computing
Transparent on-demand co-allocation data access for grids

International Journal of Ad Hoc and Ubiquitous Computing
FIRE: A File Reunion Based Data Replication Strategy for Data Grids

CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Dynamic data replication strategy based on federation data grid systems

ICICA'10 Proceedings of the First international conference on Information computing and applications
Improving job scheduling performance with parallel access to replicas in Data Grid environment

The Journal of Supercomputing
Dynamic QoS-aware data replication in grid environments based on data "importance"

Future Generation Computer Systems
PDDRA: A new pre-fetching based dynamic data replication algorithm in data grids

Future Generation Computer Systems
Network coding for creating replica in grid environments

HPCA'09 Proceedings of the Second international conference on High Performance Computing and Applications
Efficient Management of Fragmented Replica in Data Grids

International Journal of Grid and High Performance Computing
Decreasing power consumption with energy efficient data aware strategies

Future Generation Computer Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Data Grids support data-intensive applications in wide area Grid systems. They utilize local storage systems as distributed data stores by replicating datasets. Replication is a commonly used technique in a distributed environment. The motivation of replication is that replication can improve data availability, data access performance, and load balancing. Usually a complete file is copied to many Grid sites for local access. However, a site may only need parts of a replica. Therefore, to use the storage systems efficiently, it is necessary for a Grid site to store only parts of a replica. In this paper, we propose a concept called fragmented replicas. That is, when doing replication, a site can store only some partial contents needed locally. It can greatly save the storage space wasted in storing unused data. We also propose a block mapping procedure to determine the distribution of blocks in every available server for later replica retrieval. According to this procedure, a server can provide its available partial replica contents for other members in the Grid system to access. On the other hand, a client can retrieve a fragmented replica directly by using the block mapping procedure. After the block mapping procedure, some co-allocation schemes can be used to retrieve data sets from the available servers. The simulation shows that the co-allocation schemes also improve download performance in a fragmented replication system.