An incremental mining algorithm for maintaining sequential patterns using pre-large sequences

Authors:
Tzung-Pei Hong;Ching-Yao Wang;Shian-Shyong Tseng
Affiliations:
Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung 811, Taiwan and Department of Computer Science and Engineering, National Sun Yat-sen Univers ...;Information and Communications Research Laboratories, Industrial Technology Research Institute, Hsinchu, Taiwan;Institute of Computer and Information Science, National Chiao-Tung University, Hsinchu 300, Taiwan
Venue:
Expert Systems with Applications: An International Journal
Year:
2011

Citing 12
Cited 4

Mining association rules between sets of items in large databases

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Mining optimized association rules for numeric attributes

PODS '96 Proceedings of the fifteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Data Mining: An Overview from a Database Perspective

IEEE Transactions on Knowledge and Data Engineering
Using a Hash-Based Method with Transaction Trimming for Mining Association Rules

IEEE Transactions on Knowledge and Data Engineering
Database Mining: A Performance Perspective

IEEE Transactions on Knowledge and Data Engineering
Maintenance of Discovered Association Rules in Large Databases: An Incremental Updating Technique

ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Discovery of Multiple-Level Association Rules from Large Databases

VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
An Efficient Algorithm for Mining Association Rules in Large Databases

VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
A General Incremental Technique for Maintaining Discovered Association Rules

Proceedings of the Fifth International Conference on Database Systems for Advanced Applications (DASFAA)
An Adaptive Algorithm for Incremental Mining of Association Rules

DEXA '98 Proceedings of the 9th International Workshop on Database and Expert Systems Applications
A new incremental data mining algorithm using pre-large itemsets

Intelligent Data Analysis

Classification based on association rules: A lattice-based approach

Expert Systems with Applications: An International Journal
CAR-Miner: An efficient algorithm for mining class-association rules

Expert Systems with Applications: An International Journal
CSSF-trie structure to mine constraint sequential patterns from progressive database

International Journal of Knowledge Engineering and Data Mining
Incremental mining of sequential patterns: Progress and challenges

Intelligent Data Analysis

Quantified Score

Hi-index	12.05

Visualization

Abstract

Mining useful information and helpful knowledge from large databases has evolved into an important research area in recent years. Among the classes of knowledge derived, finding sequential patterns in temporal transaction databases is very important since it can help model customer behavior. In the past, researchers usually assumed databases were static to simplify data-mining problems. In real-world applications, new transactions may be added into databases frequently. Designing an efficient and effective mining algorithm that can maintain sequential patterns as a database grows is thus important. In this paper, we propose a novel incremental mining algorithm for maintaining sequential patterns based on the concept of pre-large sequences to reduce the need for rescanning original databases. Pre-large sequences are defined by a lower support threshold and an upper support threshold that act as gaps to avoid the movements of sequences directly from large to small and vice versa. The proposed algorithm does not require rescanning original databases until the accumulative amount of newly added customer sequences exceeds a safety bound, which depends on database size. Thus, as databases grow larger, the numbers of new transactions allowed before database rescanning is required also grow. The proposed approach thus becomes increasingly efficient as databases grow.