Mining sequential patterns across time sequences

  • Authors:
  • Gong Chen;Xindong Wu;Xingquan Zhu

  • Affiliations:
  • Department of Statistics, University of California, Los Angeles, CA;Department of Computer Science, University of Vermont, Burlington, VT;Department of Computer Science and Engineering, Florida Atlantic University, Boca Raton, FL

  • Venue:
  • New Generation Computing
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we deal with mining sequential patterns in multiple time sequences. Building on a state-of-the-art sequential pattern mining algorithm PrefixSpan for mining transaction databases, we propose MILE (MIning in muLtiple sEquences), an efficient algorithm to facilitate the mining process. MILE recursively utilizes the knowledge of existing patterns to avoid redundant data scanning, and therefore can effectively speed up the new patterns' discovery process. Another unique feature of MILE is that it can incorporate prior knowledge of the data distribution in time sequences into the mining process to further improve the performance. Extensive empirical results show that MILE is significantly faster than PrefixSpan. As MILE consumes more memory than PrefixSpan, we also present a solution to trade time efficiency in memory constrained environments.