On correlated availability in Internet-distributed systems

Authors:
D. Kondo;A. Andrzejak;D. P. Anderson
Affiliations:
INRIA, Le Chesnay;Zuse Inst. Berlin (ZIB), Berlin;UC Berkeley, Berkeley, CA
Venue:
GRID '08 Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing
Year:
2008

Citing 8
Cited 14

The utility of exploiting idle workstations for parallel computation

SIGMETRICS '97 Proceedings of the 1997 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Feasibility of a serverless distributed file system deployed on an existing set of desktop PCs

Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
MPICH-V: toward a scalable fault tolerant MPI for volatile nodes

Proceedings of the 2002 ACM/IEEE conference on Supercomputing
BOINC: A System for Public-Resource Computing and Storage

GRID '04 Proceedings of the 5th IEEE/ACM International Workshop on Grid Computing
The Computational and Storage Potential of Volunteer Computing

CCGRID '06 Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid
The Effectiveness of Threshold-Based Scheduling Policies in BOINC Projects

E-SCIENCE '06 Proceedings of the Second IEEE International Conference on e-Science and Grid Computing
Ridge: combining reliability and performance in open grid platforms

Proceedings of the 16th international symposium on High performance distributed computing
Resource Discovery Techniques in Distributed Desktop Grid Environments

GRID '06 Proceedings of the 7th IEEE/ACM International Conference on Grid Computing

Resource use pattern analysis for opportunistic grids

Proceedings of the 6th international workshop on Middleware for grid computing
Decentralized Resource Availability Prediction for a Desktop Grid

CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Predicting the Quality of Service of a Peer-to-Peer Desktop Grid

CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Dynamic scheduling for heterogeneous Desktop Grids

Journal of Parallel and Distributed Computing
A bi-objective scheduling algorithm for desktop grids with uncertain resource availabilities

Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part II
Long-term availability prediction for groups of volunteer resources

Journal of Parallel and Distributed Computing
Decentralized approach to resource availability prediction using group availability in a P2P desktop grid

Future Generation Computer Systems
Choosing partners based on availability in P2P networks

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Estimating deadline-miss probabilities of tasks in large distributed systems

GPC'12 Proceedings of the 7th international conference on Advances in Grid and Pervasive Computing
Reliability and availability issues in large-scale distributed systems

Proceedings of the Winter Simulation Conference
Characterizing spot price dynamics in public cloud environments

Future Generation Computer Systems
Reducing Correlated Failures Impact in Peer-to-Peer Storage Systems Using Mobile Agents Flocks

WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02
MatchTree: Flexible, scalable, and fault-tolerant wide-area resource discovery with distributed matchmaking and aggregation

Future Generation Computer Systems
Review: Volunteer computing: requirements, challenges, and solutions

Journal of Network and Computer Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

As computer networks rapidly increase in size and speed, Internet-distributed systems such as P2P, volunteer computing, and Grid systems are increasingly common. A precise and accurate characterization of Internet resources is important for the design and evaluation of such Internet-distributed systems, yet our picture of the Internet landscape is not perfectly clear. To improve this picture, we measure and characterize the time dynamics of availability in a large-scale Internet-distributed system with over 110,000 hosts. Our characterization focuses on identifying patterns of correlated availability. We determine scalable and accurate clustering techniques and distance metrics for automatically detecting significant availability patterns. By means of clustering, we identify groups of resources with correlated availability that exhibit similar time effects. Then we show how these correlated clusters of resources can be used to improve resource management for parallel applications in the context of volunteer computing.