Scheduling data-intensive bags of tasks in P2P grids with bittorrent-enabled data distribution

  • Authors:
  • Cyril Briquet;Xavier Dalem;Sébastien Jodogne;Pierre-Arnoul de Marneffe

  • Affiliations:
  • University of Liège, Liège, Belgium;University of Liège, Liège, Belgium;University of Liège, Liège, Belgium;University of Liège, Liège, Belgium

  • Venue:
  • Proceedings of the second workshop on Use of P2P, GRID and agents for the development of content networks
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Scheduling Data-Intensive Bags of Tasks in P2P Grids leads to transfers of large input data files, which cause delays in completion times. We propose to combine several existing technologies and patterns to perform efficient data-aware scheduling: (1) use of the BitTorrent P2P file sharing protocol to transfer data, (2) data caching on computational Resources, (3) use of a data-aware Resource selection scheduling algorithm similar to Storage Affinity, (4) a new Task selection scheduling algorithm (Temporal Tasks Grouping), based on the temporally grouped scheduling of Tasks sharing input data files. Data replication is also discusse. The proposed approach does not need an overlay network or Predictive Communications Ordering, making our operational implementation of a P2P Grid middleware easily deployable in unstructured P2P networks. Experiments show that performance gains are achieved by combining BitTorrent, caching, Storage Affinity and Temporal Tasks Grouping. This work can be summarized as combining P2P Grid computing and P2P data transfer technologies.