Future Generation Computer Systems - Special issue on metacomputing
The MicroGrid: a scientific tool for modeling computational gridsr
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
A case for using MPI's derived datatypes to improve I/O performance
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Data management and transfer in high-performance computational grid environments
Parallel Computing - Parallel data-intensive algorithms and applications
Predicting the Performance of Wide Area Data Transfers
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Replica Selection in the Globus Data Grid
CCGRID '01 Proceedings of the 1st International Symposium on Cluster Computing and the Grid
Decoupling Computation and Data Scheduling in Distributed Data-Intensive Applications
HPDC '02 Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing
Downloading Replicated, Wide-Area Files - A Framework and Empirical Evaluation
NCA '04 Proceedings of the Network Computing and Applications, Third IEEE International Symposium
GrADSolve: a grid-based RPC system for parallel computing with application-level scheduling
Journal of Parallel and Distributed Computing - Special issue on middleware
Performance and Scalability of a Replica Location Service
HPDC '04 Proceedings of the 13th IEEE International Symposium on High Performance Distributed Computing
Realistic Large-Scale Online Network Simulation
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
A Peer-to-Peer Replica Location Service Based on a Distributed Hash Table
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
The Livny and Plank-Beck Problems: Studies in Data Movement on the Computational Grid
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Numerical Libraries and the Grid
International Journal of High Performance Computing Applications
The impact of data replication on job scheduling performance in the Data Grid
Future Generation Computer Systems
A hypergraph partitioning based approach for scheduling of tasks with batch-shared I/O
CCGRID '05 Proceedings of the Fifth IEEE International Symposium on Cluster Computing and the Grid (CCGrid'05) - Volume 2 - Volume 02
Complete and fragmented replica selection and retrieval in Data Grids
Future Generation Computer Systems
Intelligent data staging with overlapped execution of grid applications
Future Generation Computer Systems
GRID '06 Proceedings of the 7th IEEE/ACM International Conference on Grid Computing
A Recursively-Adjusting Co-allocation scheme with a Cyber-Transformer in Data Grids
Future Generation Computer Systems
A data locality aware online scheduling approach for I/O-intensive jobs with file sharing
JSSPP'06 Proceedings of the 12th international conference on Job scheduling strategies for parallel processing
Centralized versus distributed schedulers for multiple bag-of-task applications
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Exploiting replication and data reuse to efficiently schedule data-intensive applications on grids
JSSPP'04 Proceedings of the 10th international conference on Job Scheduling Strategies for Parallel Processing
Scheduling file transfers for data-intensive jobs on heterogeneous clusters
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
Memory-Mapped File Approach for On-Demand Data Co-allocation on Grids
CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
Accessing data from many servers simultaneously and adaptively in data grids
Future Generation Computer Systems
An integrated security-aware job scheduling strategy for large-scale computational grids
Future Generation Computer Systems
A DSM-based fragmented data sharing framework for grids
Future Generation Computer Systems
Transparent on-demand co-allocation data access for grids
International Journal of Ad Hoc and Ubiquitous Computing
Improving job scheduling performance with parallel access to replicas in Data Grid environment
The Journal of Supercomputing
Bacteria foraging optimization for protein sequence analysis on the grid
Future Generation Computer Systems
International Journal of Grid and High Performance Computing
Hi-index | 0.00 |
Grids are being used for executing parallel applications over remote resources. For executing a parallel application on a set of grid resources chosen by a user or a grid scheduler, the input data needed by the application is segmented according to the data distribution followed in the application and the data segments are distributed to the grid resources. The same input data may be used subsequently by different applications leading to multiple copies (replicas) of parallel data segments in various grid resources. The data needed for a parallel application can be gathered from the existing replicas onto the computational resources chosen by the grid scheduler for application execution. In this work, we have devised novel algorithms for determining ''nearest'' replica sites containing data segments needed by a parallel application executing on a set of resources with the objective of minimizing the time needed for transferring the data segments from the replica sites to the resources. We have tested our algorithms on different kinds of experimental setups. We find that the best algorithm varies according to the configuration of data servers and clients. In all cases, our algorithms performed better than the existing algorithms by at least 15%.