Database dependency discovery: a machine learning approach

Authors:
Peter A. Flach;Iztok Savnik
Affiliations:
Department of Computer Science, University of Bristol, Bristol BS8 1UB, UK E-mail:Peter.Flach@bristol.ac.uk;Faculty of Computer and Information Science, University of Ljubljana, 1000 Ljubljana, Slovenia E-mail: Iztok.Savnik@fri.uni:lj.si
Venue:
AI Communications
Year:
1999

Citing 15
Cited 28

Subsumption and implication

Information Processing Letters
Principles of database and knowledge-base systems, Vol. I

Principles of database and knowledge-base systems, Vol. I
The design of relational databases

The design of relational databases
Algorithms for inferring functional dependencies from relations

Data & Knowledge Engineering
Approximate inference of functional dependencies from relations

ICDT '92 Selected papers of the fourth international conference on Database theory
Logical settings for concept-learning

Artificial Intelligence
On the menbership problem for functional and multivalued dependencies in relational databases

ACM Transactions on Database Systems (TODS)
Horn clauses and database dependencies

Journal of the ACM (JACM)
Logic and Databases: A Deductive Approach

ACM Computing Surveys (CSUR)
Advances in Inductive Logic Programming

Advances in Inductive Logic Programming
Predicate Invention in Inductive Data Engineering

ECML '93 Proceedings of the European Conference on Machine Learning
Dependency Inference

VLDB '87 Proceedings of the 13th International Conference on Very Large Data Bases
Normal Forms for Inductive Logic Programming

ILP '97 Proceedings of the 7th International Workshop on Inductive Logic Programming
Discovery of multivalued dependencies from relations

Discovery of multivalued dependencies from relations
Theory of Relational Databases

Theory of Relational Databases

Internet resources on ILP for KDD

Relational Data Mining
Confirmation-Guided Discovery of First-Order Rules with Tertius

Machine Learning
Learning in Clausal Logic: A Perspective on Inductive Logic Programming

Computational Logic: Logic Programming and Beyond, Essays in Honour of Robert A. Kowalski, Part I
FastFDs: A Heuristic-Driven, Depth-First Algorithm for Mining Functional Dependencies from Relation Instances - Extended Abstract

DaWaK '01 Proceedings of the Third International Conference on Data Warehousing and Knowledge Discovery
Rule Evaluation Measures: A Unifying View

ILP '99 Proceedings of the 9th International Workshop on Inductive Logic Programming
Subgroup Discovery with CN2-SD

The Journal of Machine Learning Research
SQL-based discovery of exact and approximate functional dependencies

Working group reports from ITiCSE on Innovation and technology in computer science education
Approximate matching of textual domain attributes for information source integration

Proceedings of the 2nd international workshop on Information quality in information systems
Discovering functional dependencies from similarity-based fuzzy relational databases

Intelligent Data Analysis
Discovery of multivalued dependencies from relations

Intelligent Data Analysis
Non-deterministic ideal operators: An adequate tool for formalization in Data Bases

Discrete Applied Mathematics
Using association rules to mine for strong approximate dependencies

Data Mining and Knowledge Discovery
Discovering branching and fractional dependencies in databases

Data & Knowledge Engineering
AD-Miner: A new incremental method for discovery of minimal approximate dependencies using logical operations

Intelligent Data Analysis
Characteristic relational patterns

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Discovering functional dependencies for multidimensional design

Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP
Design-level metrics estimation based on code metrics

Proceedings of the 2010 ACM Symposium on Applied Computing
iZi: a new toolkit for pattern mining problems

ISMIS'08 Proceedings of the 17th international conference on Foundations of intelligent systems
Using ontologies to discover fact IDs

DOLAP '10 Proceedings of the ACM 13th international workshop on Data warehousing and OLAP
The iZi project: easy prototyping of interesting pattern mining algorithms

PAKDD'09 Proceedings of the 13th Pacific-Asia international conference on Knowledge discovery and data mining: new frontiers in applied data mining
Differential dependencies: Reasoning and discovery

ACM Transactions on Database Systems (TODS)
Advancing the discovery of unique column combinations

Proceedings of the 20th ACM international conference on Information and knowledge management
Characterization and armstrong relations for degenerate multivalued dependencies using formal concept analysis

ICFCA'05 Proceedings of the Third international conference on Formal Concept Analysis
Learning schema mappings

Proceedings of the 15th International Conference on Database Theory
Comparable dependencies over heterogeneous data

The VLDB Journal — The International Journal on Very Large Data Bases
Letting keys and functional dependencies out of the bag

APCCM '13 Proceedings of the Ninth Asia-Pacific Conference on Conceptual Modelling - Volume 143
Editorial: Efficient discovery of similarity constraints for matching dependencies

Data & Knowledge Engineering
Learning schema mappings

ACM Transactions on Database Systems (TODS) - Invited papers issue

Quantified Score

Hi-index	0.00

Visualization

Abstract

Database dependencies, such as functional and multivalueddependencies, express the presence of structure in databaserelations, that can be utilised in the database design process. Thediscovery of database dependencies can be viewed as an inductionproblem, in which general rules (dependencies) are obtained fromspecific facts (the relation). This viewpoint has the advantage ofabstracting away as much as possible from the particulars of thedependencies. The algorithms in this paper are designed such thatthey can easily be generalised to other kinds of dependencies.Likein current approaches to computational induction such as inductivelogic programming, we distinguish between top-down algorithms andbottom-up algorithms. In a top-down approach, hypotheses aregenerated in a systematic way and then tested against the givenrelation. In a bottom-up approach, the relation is inspected inorder to see what dependencies it may satisfy or violate. We give asimple (but inefficient) top-down algorithm, a bi-directionalalgorithm, and a bottom-up algorithm. In the case of functionaldependencies, these algorithms have been implemented in the FDEPsystem and evaluated experimentally. The bottom-up algorithm is themost efficient of the three, and also outperforms other algorithmsfrom the literature.