Mining Sequential Patterns Using Graph Search Techniques

  • Authors:
  • Yin-Fu Huang;Shao-Yuan Lin

  • Affiliations:
  • -;-

  • Venue:
  • COMPSAC '03 Proceedings of the 27th Annual International Conference on Computer Software and Applications
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

Sequential patterns discovery has emerged as animportant problem in data mining. In this paper, wepropose an effective GST algorithm for mining sequentialpatterns in a large transaction database. Different from theApriori-like algorithms, the GST algorithm can out oforder find large k-sequences (k = 3); i.e., we can findlarge k-sequences not directly through large(k-1)-sequences. This leads to that our algorithm has muchbetter performance than the Apriori-like algorithms.Besides, we also propose the method to find newsequential patterns by scanning only new transactionssince the database was increased. Through severalcomprehensive experiments, the GST algorithm gains asignificant performance improvement over the Apriori-likealgorithms. Also we found as long as the ratio of the itemspurchased in new transactions is not close to 100%,scanning only new transactions is always much better thanscanning the entire database.