Pattern-based time-series subsequence clustering using radial distribution functions

  • Authors:
  • Anne M. Denton;Christopher A. Besemann;Dietmar H. Dorr

  • Affiliations:
  • North Dakota State University, Department of Computer Science and Operations Research, 58105-5164, Fargo, ND, USA;North Dakota State University, Department of Computer Science and Operations Research, 58105-5164, Fargo, ND, USA;North Dakota State University, Department of Computer Science and Operations Research, 58105-5164, Fargo, ND, USA

  • Venue:
  • Knowledge and Information Systems
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Clustering of time series subsequence data commonly produces results that are unspecific to the data set. This paper introduces a clustering algorithm, that creates clusters exclusively from those subsequences that occur more frequently in a data set than would be expected by random chance. As such, it partially adopts a pattern mining perspective into clustering. When subsequences are being labeled based on such clusters, they may remain without label. In fact, if the clustering was done on an unrelated time series it is expected that the subsequences should not receive a label. We show that pattern-based clusters are indeed specific to the data set for 7 out of 10 real-world sets we tested, and for window-lengths up to 128 time points. While kernel-density-based clustering can be used to find clusters with similar properties for window sizes of 8–16 time points, its performance degrades fast for increasing window sizes.