Discovering Patterns from Large and Dynamic Sequential Data

  • Authors:
  • Ke Wang

  • Affiliations:
  • Department of Information Systems and Computer Science, National University of Singapore, Lower Kent Ridge Road, Singapore, 119260. E-mail: wangk@iscs.nus.edu.sg

  • Venue:
  • Journal of Intelligent Information Systems
  • Year:
  • 1997

Quantified Score

Hi-index 0.00

Visualization

Abstract

Most daily and scientific data are sequential in nature. Discoveringimportant patterns from such data can benefit the user and scientist bypredicting coming activities, interpreting recurring phenomena, extractingoutstanding similarities and differences for close attention, compressingdata, and detecting intrusion. We consider the following incrementaldiscovery problem for large and dynamic sequential data. Suppose thatpatterns were previously discovered and materialized. An update is made tothe sequential database. An incremental discovery will take advantage ofdiscovered patterns and compute only the change by accessing the affectedpart of the database and data structures. In addition to patterns, thestatistics and position information of patterns need to be updated to allowfurther analysis and processing on patterns. We present an efficientalgorithm for the incremental discovery problem. The algorithm is applied tosequential data that honors several sequential patterns modeling weatherchanges in Singapore. The algorithm finds what it is supposed to find.Experiments show that for small updates and large databases, the incrementaldiscovery algorithm runs in time independent of the data size.