Naive Bayesian Classification of Structured Data

Authors:
Peter A. Flach;Nicolas Lachiche
Affiliations:
Department of Computer Science, University of Bristol, United Kingdom. peter.flach@bristol.ac.uk;LSIIT, Université Louis Pasteur, Strasbourg, France. lachiche@lsiit.u-strasbg.fr
Venue:
Machine Learning
Year:
2004

Citing 22
Cited 20

Flattening and Saturation: Two Representation Changes for Generalization

Machine Learning - Special issue on evaluating and changing representation
Solving the multiple instance problem with axis-parallel rectangles

Artificial Intelligence
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss

Machine Learning - Special issue on learning with probabilistic representations
Confirmation-guided discovery of first-order rules with tertius

Machine Learning
An extended transformation approach to inductive logic programming

ACM Transactions on Computational Logic (TOCL) - Special issue devoted to Robert A. Kowalski
A Simple Generalisation of the Area Under the ROC Curve for Multiple Class Classification Problems

Machine Learning
Parameter Estimation in Stochastic Logic Programs

Machine Learning
Relational Data Mining

Relational Data Mining
Introduction to Database Systems

Introduction to Database Systems
Propositionalization approaches to relational data mining

Relational Data Mining
Learning probabilistic relational models

Relational Data Mining
Strongly Typed Inductive Concept Learning

ILP '98 Proceedings of the 8th International Workshop on Inductive Logic Programming
Maximum Entropy Modeling with Clausal Constraints

ILP '97 Proceedings of the 7th International Workshop on Inductive Logic Programming
Combining Statistical and Relational Methods for Learning in Hypertext Domains

ILP '98 Proceedings of the 8th International Workshop on Inductive Logic Programming
Logic and Learning

Logic and Learning
Kernels and Distances for Structured Data

Machine Learning
PRISM: a language for symbolic-statistical modeling

IJCAI'97 Proceedings of the Fifteenth international joint conference on Artifical intelligence - Volume 2
Parameter learning of logic programs for symbolic-statistical modeling

Journal of Artificial Intelligence Research
Probabilistic classification and clustering in relational data

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
1BC2: a true first-order Bayesian classifier

ILP'02 Proceedings of the 12th international conference on Inductive logic programming
RSD: relational subgroup discovery through first-order feature construction

ILP'02 Proceedings of the 12th international conference on Inductive logic programming
Estimating continuous distributions in Bayesian classifiers

UAI'95 Proceedings of the Eleventh conference on Uncertainty in artificial intelligence

Distribution-based aggregation for relational learning with identifier attributes

Machine Learning
A suffix tree approach to anti-spam email filtering

Machine Learning
Spatial associative classification: propositional vs structural approach

Journal of Intelligent Information Systems
Integrating Naïve Bayes and FOIL

The Journal of Machine Learning Research
Classification in Networked Data: A Toolkit and a Univariate Case Study

The Journal of Machine Learning Research
Transductive Learning from Relational Data

MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
Structure Learning of Probabilistic Relational Models from Incomplete Relational Data

ECML '07 Proceedings of the 18th European conference on Machine Learning
Hierarchical Classifiers for Complex Spatio-temporal Concepts

Transactions on Rough Sets IX
An Inductive Logic Programming Approach to Statistical Relational Learning

Proceedings of the 2005 conference on An Inductive Logic Programming Approach to Statistical Relational Learning
nFOIL: integrating Naïve Bayes and FOIL

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Classification of graphical data made easy

Neurocomputing
Social network classification incorporating link typevalues

ISI'09 Proceedings of the 2009 IEEE international conference on Intelligence and security informatics
Frequent variable sets based clustering for artificial neural networks particle classification

APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
Probabilistic inductive logic programming

Probabilistic inductive logic programming
Combining heterogeneous classifiers for relational databases

Pattern Recognition
Soccer ball detection by comparing different feature extraction methodologies

Advances in Artificial Intelligence
Simple decision forests for multi-relational classification

Decision Support Systems
Accurate ball detection in soccer images using probabilistic analysis of salient regions

Machine Vision and Applications
On the multi-agent learning neural and Bayesian methods in skin detector and pornography classifier: An automated anti-pornography system

Neurocomputing
Empowering difficult classes with a similarity-based aggregation in multi-class classification problems

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we present 1BC and 1BC2, two systems that perform naive Bayesian classification of structured individuals. The approach of 1BC is to project the individuals along first-order features. These features are built from the individual using structural predicates referring to related objects (e.g., atoms within molecules), and properties applying to the individual or one or several of its related objects (e.g., a bond between two atoms). We describe an individual in terms of elementary features consisting of zero or more structural predicates and one property; these features are treated as conditionally independent in the spirit of the naive Bayes assumption. 1BC2 represents an alternative first-order upgrade to the naive Bayesian classifier by considering probability distributions over structured objects (e.g., a molecule as a set of atoms), and estimating those distributions from the probabilities of its elements (which are assumed to be independent). We present a unifying view on both systems in which 1BC works in language space, and 1BC2 works in individual space. We also present a new, efficient recursive algorithm improving upon the original propositionalisation approach of 1BC. Both systems have been implemented in the context of the first-order descriptive learner Tertius, and we investigate the differences between the two systems both in computational terms and on artificially generated data. Finally, we describe a range of experiments on ILP benchmark data sets demonstrating the viability of our approach.