Clustering Data Streams in Optimization and Geography Domains

  • Authors:
  • Ling-Yin Wei;Wen-Chih Peng

  • Affiliations:
  • Institute of Computer Science and Engineering, National Chiao Tung University, Hsinchu, Taiwan, ROC;Institute of Computer Science and Engineering, National Chiao Tung University, Hsinchu, Taiwan, ROC

  • Venue:
  • PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we formulate a dual clustering problem in spatial data streams. A spatial data stream consists of data points with attributes in the optimization and geography domains. We aim at partitioning these objects into disjoint clusters such that at each time window (1) objects in the same cluster satisfy the transitively r-connected relation in the optimization and geography domains, and (2) the number of clusters is as minimal as possible. We propose a Hierarchical-Based Clustering algorithm (HBC). Specifically, objects are represented as a graph structure, called RGraph, where each node represents an object and edges indicate their similarity relationships. In light of RGraph, algorithm HBC iteratively merges clusters. Experimental results show the performance of the algorithm.