Decoupling Computation and Data Scheduling in Distributed Data-Intensive Applications
HPDC '02 Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing
Grid Computing: Making the Global Infrastructure a Reality
Grid Computing: Making the Global Infrastructure a Reality
The Globus Striped GridFTP Framework and Server
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
The File Mover: an efficient data transfer system for Grid applications
CCGRID '04 Proceedings of the 2004 IEEE International Symposium on Cluster Computing and the Grid
Collaborative Data Distribution with BitTorrent for Computational Desktop Grids
ISPDC '05 Proceedings of the The 4th International Symposium on Parallel and Distributed Computing
Simple Locality-Aware Co-allocation in Peer-to-Peer Supercomputing
CCGRID '06 Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid
Scheduling Independent Tasks Sharing Large Data Distributed with BitTorrent
GRID '05 Proceedings of the 6th IEEE/ACM International Workshop on Grid Computing
Approximate policy iteration for closed-loop learning of visual tasks
ECML'06 Proceedings of the 17th European conference on Machine Learning
Exploiting replication and data reuse to efficiently schedule data-intensive applications on grids
JSSPP'04 Proceedings of the 10th international conference on Job Scheduling Strategies for Parallel Processing
Reproducible testing of distributed software with middleware virtualization and simulation
PADTAD '08 Proceedings of the 6th workshop on Parallel and distributed systems: testing, analysis, and debugging
A workflow model for heterogeneous computing environments
Future Generation Computer Systems
On the Design of a Performance-Aware Load Balancing Mechanism for P2P Grid Systems
GPC '09 Proceedings of the 4th International Conference on Advances in Grid and Pervasive Computing
P2P file sharing for P2P computing
Multiagent and Grid Systems - Content management and delivery through P2P-based content networks
A taxonomy of peer-to-peer desktop grid paradigms
Cluster Computing
A family of heuristics for agent-based elastic Cloud bag-of-tasks concurrent scheduling
Future Generation Computer Systems
Hi-index | 0.00 |
Scheduling Data-Intensive Bags of Tasks in P2P Grids leads to transfers of large input data files, which cause delays in completion times. We propose to combine several existing technologies and patterns to perform efficient data-aware scheduling: (1) use of the BitTorrent P2P file sharing protocol to transfer data, (2) data caching on computational Resources, (3) use of a data-aware Resource selection scheduling algorithm similar to Storage Affinity, (4) a new Task selection scheduling algorithm (Temporal Tasks Grouping), based on the temporally grouped scheduling of Tasks sharing input data files. Data replication is also discusse. The proposed approach does not need an overlay network or Predictive Communications Ordering, making our operational implementation of a P2P Grid middleware easily deployable in unstructured P2P networks. Experiments show that performance gains are achieved by combining BitTorrent, caching, Storage Affinity and Temporal Tasks Grouping. This work can be summarized as combining P2P Grid computing and P2P data transfer technologies.