Mining association rules between sets of items in large databases
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
FreeSpan: frequent pattern-projected sequential pattern mining
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Sequence mining in categorical domains: incorporating constraints
Proceedings of the ninth international conference on Information and knowledge management
SPADE: an efficient algorithm for mining frequent sequences
Machine Learning
Mining Sequential Patterns with Regular Expression Constraints
IEEE Transactions on Knowledge and Data Engineering
Mining Sequential Patterns: Generalizations and Performance Improvements
EDBT '96 Proceedings of the 5th International Conference on Extending Database Technology: Advances in Database Technology
ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth
Proceedings of the 17th International Conference on Data Engineering
The PSP Approach for Mining Sequential Patterns
PKDD '98 Proceedings of the Second European Symposium on Principles of Data Mining and Knowledge Discovery
Mining Generalized Association Rules
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Efficient Constraint-Based Sequential Pattern Mining Using Dataset Filtering Techniques
Proceedings of the Baltic Conference, BalticDB&IS 2002 - Volume 1
Sequential PAttern mining using a bitmap representation
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
A new algorithm for gap constrained sequence mining
Proceedings of the 2004 ACM symposium on Applied computing
BIDE: Efficient Mining of Frequent Closed Sequences
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Pre-Processing Time Constraints for Efficiently Mining Generalized Sequential Patterns
TIME '04 Proceedings of the 11th International Symposium on Temporal Representation and Reasoning
Efficient mining of sequential patterns with time constraints by delimited pattern growth
Knowledge and Information Systems
Mining sequences with temporal annotations
Proceedings of the 2006 ACM symposium on Applied computing
Sequential pattern mining algorithm for automotive warranty data
Computers and Industrial Engineering
SO_MAD: SensOr Mining for Anomaly Detection in Railway Data
ICDM '09 Proceedings of the 9th Industrial Conference on Advances in Data Mining. Applications and Theoretical Aspects
Mining closed flexible patterns in time-series databases
Expert Systems with Applications: An International Journal
Mining Web navigation patterns with a path traversal graph
Expert Systems with Applications: An International Journal
Extracting temporal patterns from interval-based sequences
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Projection-based partial periodic pattern mining for event sequences
Expert Systems with Applications: An International Journal
Closeness Preference - A new interestingness measure for sequential rules mining
Knowledge-Based Systems
A data mining approach to discovering reliable sequential patterns
Journal of Systems and Software
Hi-index | 12.06 |
In this paper we consider the problem of discovering sequential patterns by handling time constraints as defined in the Gsp algorithm. While sequential patterns could be seen as temporal relationships between facts embedded in the database where considered facts are merely characteristics of individuals or observations of individual behavior, generalized sequential patterns aim to provide the end user with a more flexible handling of the transactions embedded in the database. We thus propose a new efficient algorithm, called Gtc (Graph for Time Constraints) for mining such patterns in very large databases. It is based on the idea that handling time constraints in the earlier stage of the data mining process can be highly beneficial. One of the most significant new feature of our approach is that handling of time constraints can be easily taken into account in traditional levelwise approaches since it is carried out prior to and separately from the counting step of a data sequence. Our test shows that the proposed algorithm performs significantly faster than a state-of-the-art sequence mining algorithm.