On correlated availability in Internet-distributed systems

  • Authors:
  • D. Kondo;A. Andrzejak;D. P. Anderson

  • Affiliations:
  • INRIA, Le Chesnay;Zuse Inst. Berlin (ZIB), Berlin;UC Berkeley, Berkeley, CA

  • Venue:
  • GRID '08 Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

As computer networks rapidly increase in size and speed, Internet-distributed systems such as P2P, volunteer computing, and Grid systems are increasingly common. A precise and accurate characterization of Internet resources is important for the design and evaluation of such Internet-distributed systems, yet our picture of the Internet landscape is not perfectly clear. To improve this picture, we measure and characterize the time dynamics of availability in a large-scale Internet-distributed system with over 110,000 hosts. Our characterization focuses on identifying patterns of correlated availability. We determine scalable and accurate clustering techniques and distance metrics for automatically detecting significant availability patterns. By means of clustering, we identify groups of resources with correlated availability that exhibit similar time effects. Then we show how these correlated clusters of resources can be used to improve resource management for parallel applications in the context of volunteer computing.