Design and evaluation of decentralized online clustering

Authors:
Andres Quiroz;Manish Parashar;Nathan Gnanasambandam;Naveen Sharma
Affiliations:
Xerox Research, Webster, NY;Rutgers University;Xerox Research, Webster, NY;Xerox Research, Webster, NY
Venue:
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Year:
2012

Citing 16
Cited 0

Algorithms for clustering data

Algorithms for clustering data
CURE: an efficient clustering algorithm for large databases

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Automatic subspace clustering of high dimensional data for data mining applications

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Data clustering: a review

ACM Computing Surveys (CSUR)
LOF: identifying density-based local outliers

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Chord: A scalable peer-to-peer lookup service for internet applications

Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
Chameleon: Hierarchical Clustering Using Dynamic Modeling

Computer
Analysis of the Clustering Properties of the Hilbert Space-Filling Curve

IEEE Transactions on Knowledge and Data Engineering
Flexible Information Discovery in Decentralized Distributed Systems

HPDC '03 Proceedings of the 12th IEEE International Symposium on High Performance Distributed Computing
A method for decentralized clustering in large multi-agent systems

AAMAS '03 Proceedings of the second international joint conference on Autonomous agents and multiagent systems
PENS: an algorithm for density-based clustering in peer-to-peer systems

InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
Design patterns from biology for distributed computing

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Introduction to Information Retrieval

Introduction to Information Retrieval
Robust clustering analysis for the management of self-monitoring distributed systems

Cluster Computing
Autonomic policy adaptation using decentralized online clustering

Proceedings of the 7th international conference on Autonomic computing
Clustering distributed data streams in peer-to-peer environments

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Ensuring the efficient and robust operation of distributed computational infrastructures is critical, given that their scale and overall complexity is growing at an alarming rate and that their management is rapidly exceeding human capability. Clustering analysis can be used to find patterns and trends in system operational data, as well as highlight deviations from these patterns. Such analysis can be essential for verifying the correctness and efficiency of the operation of the system, as well as for discovering specific situations of interest, such as anomalies or faults, that require appropriate management actions. This work analyzes the automated application of clustering for online system management, from the point of view of the suitability of different clustering approaches for the online analysis of system data in a distributed environment, with minimal prior knowledge and within a timeframe that allows the timely interpretation of and response to clustering results. For this purpose, we evaluate DOC (Decentralized Online Clustering), a clustering algorithm designed to support data analysis for autonomic management, and compare it to existing and widely used clustering algorithms. The comparative evaluations will show that DOC achieves a good balance in the trade-offs inherent in the challenges for this type of online management.