Mining closed patterns in relational, graph and network data

Authors:
Gemma C. Garriga;Roni Khardon;Luc De Raedt
Affiliations:
INRIA Lille Nord Europe, Lille, France;Department of Computer Science, Tufts University, Medford, USA;Department of Computer Science, Katholieke Universiteit Leuven, Leuven, Belgium
Venue:
Annals of Mathematics and Artificial Intelligence
Year:
2013

Citing 29
Cited 0

Foundations of logic programming; (2nd extended ed.)

Foundations of logic programming; (2nd extended ed.)
Mining association rules between sets of items in large databases

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Clausal Discovery

Machine Learning - special issue on inductive logic programming
Fast discovery of association rules

Advances in knowledge discovery and data mining
Mining frequent patterns without candidate generation

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Learning logic programs with structured background knowledge

Artificial Intelligence
Foundations of Inductive Logic Programming

Foundations of Inductive Logic Programming
Formal Concept Analysis: Mathematical Foundations

Formal Concept Analysis: Mathematical Foundations
Discovery of relational association rules

Relational Data Mining
Levelwise Search and Borders of Theories in KnowledgeDiscovery

Data Mining and Knowledge Discovery
Feature Construction with Version Spaces for Biochemical Applications

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Learning of Simple Conceptual Graphs from Positive and Negative Examples

PKDD '99 Proceedings of the Third European Conference on Principles of Data Mining and Knowledge Discovery
Mining Minimal Non-redundant Association Rules Using Frequent Closed Itemsets

CL '00 Proceedings of the First International Conference on Computational Logic
Application of Different Learning Methods to Hungarian Part-of-Speech Tagging

ILP '99 Proceedings of the 9th International Workshop on Inductive Logic Programming
Discovering Associations between Spatial Objects: An ILP Application

ILP '01 Proceedings of the 11th International Conference on Inductive Logic Programming
Discovering all most specific sentences

ACM Transactions on Database Systems (TODS)
gSpan: Graph-Based Substructure Pattern Mining

ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
On Maximal Frequent and Minimal Infrequent Sets in Binary Matrices

Annals of Mathematics and Artificial Intelligence
Frequent Sub-Structure-Based Approaches for Classifying Chemical Compounds

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
CloseGraph: mining closed frequent graph patterns

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Advances in frequent itemset mining implementations: report on FIMI'03

ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Mining Non-Redundant Association Rules

Data Mining and Knowledge Discovery
Horn axiomatizations for sequential data

Theoretical Computer Science
A machine learning approach to building domain-specific search engines

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
On mining closed sets in multi-relational data

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
What is frequent in a single graph?

PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Effective feature construction by maximum common subgraph sampling

Machine Learning
An output-polynomial time algorithm for mining frequent closed attribute trees

ILP'05 Proceedings of the 15th international conference on Inductive Logic Programming
Learning closed sets of labeled graphs for chemical applications

ILP'05 Proceedings of the 15th international conference on Inductive Logic Programming

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recent theoretical insights have led to the introduction of efficient algorithms for mining closed item-sets. This paper investigates potential generalizations of this paradigm to mine closed patterns in relational, graph and network databases. Several semantics and associated definitions for closed patterns in relational data have been introduced in previous work, but the differences among these and the implications of the choice of semantics was not clear. The paper investigates these implications in the context of generalizing the LCM algorithm, an algorithm for enumerating closed item-sets. LCM is attractive since its run time is linear in the number of closed patterns and since it does not need to store the patterns output in order to avoid duplicates, further reducing memory signature and run time. Our investigation shows that the choice of semantics has a dramatic effect on the properties of closed patterns and as a result, in some settings a generalization of the LCM algorithm is not possible. On the other hand, we provide a full generalization of LCM for the semantic setting that has been previously used by the Claudien system.