Sequential patterns mining on high-dimensional data stream

  • Authors:
  • Qin-Hua Huang

  • Affiliations:
  • Modern Education Technique Center, Shanghai University of Political Science and Law, Shanghai, China

  • Venue:
  • ICIC'12 Proceedings of the 8th international conference on Intelligent Computing Theories and Applications
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Sequential pattern mining is an important problem in many data mining applications. When the data comes as stream, its mining becomes difficult. Unlike relational database the stream data size keep growing while the memory size is fixed relatively. So it is unfeasible to store all the past data in many applications. Hence one-time scan algorithm is needed to execute mining on stream data. As well as the increasing application of sensors, data type of image, video are produced in a rapid speed. Those data usually are with high dimensionality. Inspired by this, we present a new method of mining high dimensional stream data in this paper. A model based on forest structure is developed to index the sequence and to find the new patterns. The algorithm is evaluated on a set of large synthetic data stream.