Are p2p data-dissemination techniques viable in today's data-intensive scientific collaborations?

  • Authors:
  • Samer Al-Kiswany;Matei Ripeanu;Adriana Iamnitchi;Sudharshan Vazhkudai

  • Affiliations:
  • University of British Columbia;University of British Columbia;University of South Florida;Oak Ridge National Laboratory

  • Venue:
  • Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

The interest among a geographically distributed user base to mine massive collections of scientific data propels the need for efficient data dissemination solutions. An optimal data distribution scheme will find the delicate and often application-specific balance among conflicting success metrics such as minimizing transfer times, minimizing the impact on the network, and uniformly distributing load among participants. We use simulations to explore the performance of classes of data-distribution techniques, some of which successfully deployed in large peer-to-peer communities, in the context of today's data-centric scientific collaborations. Based on these simulations we derive several recommendations for data distribution in real-world science collaborations.