Knowledge discovery with second-order relations

Authors:
Rattikorn Hewett;John Leuchner
Affiliations:
Institute for Human and Machine Cognition, University of West Florida, Pensacola FL;Institute for Human and Machine Cognition, University of West Florida, Pensacola FL
Venue:
Knowledge and Information Systems
Year:
2002

Citing 33
Cited 2

Prime Implicants, Minimum Covers, and the Complexity of Logic Simplification

IEEE Transactions on Computers
Machine learning an artificial intelligence approach volume II

Machine learning an artificial intelligence approach volume II
Principles of database and knowledge-base systems, Vol. I

Principles of database and knowledge-base systems, Vol. I
The structure of the relational database model

The structure of the relational database model
Learning nonrecursive definitions of relations with LINUS

EWSL-91 Proceedings of the European working session on learning on Machine learning
Absolute Minimization of Completely Specified Switching Functions

IEEE Transactions on Computers
The Utility of Knowledge in Inductive Learning

Machine Learning
C4.5: programs for machine learning

C4.5: programs for machine learning
Very Simple Classification Rules Perform Well on Most Commonly Used Datasets

Machine Learning
Logic synthesis

Logic synthesis
Two-level logic minimization: an overview

Integration, the VLSI Journal
Generalizing Version Spaces

Machine Learning
Mining quantitative association rules in large relational tables

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Advances in knowledge discovery and data mining

Advances in knowledge discovery and data mining
Inductive logic programming and knowledge discovery in databases

Advances in knowledge discovery and data mining
Predicting equity returns from securities data

Advances in knowledge discovery and data mining
From data mining to knowledge discovery: current challenges and future directions

Advances in knowledge discovery and data mining
Extending the database relational model to capture more meaning

ACM Transactions on Database Systems (TODS)
A relational model of data for large shared data banks

Communications of the ACM
Stochastic Complexity in Statistical Inquiry Theory

Stochastic Complexity in Statistical Inquiry Theory
Machine Learning

Machine Learning
Inductive Logic Programming: Techniques and Applications

Inductive Logic Programming: Techniques and Applications
Computers and Intractability: A Guide to the Theory of NP-Completeness

Computers and Intractability: A Guide to the Theory of NP-Completeness
Knowledge Discovery in Databases

Knowledge Discovery in Databases
The Role of Occam‘s Razor in Knowledge Discovery

Data Mining and Knowledge Discovery
Learning Logical Definitions from Relations

Machine Learning
The CN2 Induction Algorithm

Machine Learning
Induction of Decision Trees

Machine Learning
The Power of Decision Tables

ECML '95 Proceedings of the 8th European Conference on Machine Learning
SLIQ: A Fast Scalable Classifier for Data Mining

EDBT '96 Proceedings of the 5th International Conference on Extending Database Technology: Advances in Database Technology
Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Version spaces: an approach to concept learning.

Version spaces: an approach to concept learning.
MINI: a heuristic approach for logic minimization

IBM Journal of Research and Development

Restructuring decision tables for elucidation of knowledge

Data & Knowledge Engineering
Inference of abduction theories for handling incompleteness in first-order learning

Knowledge and Information Systems - Special Issue on Mining Low-Quality Data

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents an induction technique that discovers a set of classification rules, from a set of examples, using second-order relations as a representational model. Second-order relations are database relations in which tuples have sets of atomic values as components. Using sets of values, which are interpreted as disjunctions, provides compact representations that facilitate efficient management and enhance comprehensibility. The second-order relational framework is based on theoretical foundations that link relational database theory, machine learning, and logic synthesis. The rule induction technique can be viewed as a second-order relation compression problem in which the original relation, representing training data, is transformed into a second-order relation with fewer tuples by merging tuples in ways that preserve consistency with the training data. This problem is closely related to two-level Boolean function minimization in logic synthesis. We describe a rule-mining system, SORCER, and compare its performance to two state-of-the-art classification systems: C4.5 and CBA. Experimental results based on the average of error rates ove 26 data sets show that SORCER, using a simple compression scheme, outperforms C4.5 and is competitive to CBA. Using a slightly more sophisticated compression scheme, SORCER outperforms both C4.5 and CBA.