Simple Estimators for Relational Bayesian Classifiers

  • Authors:
  • Jennifer Neville;David Jensen;Brian Gallagher

  • Affiliations:
  • -;-;-

  • Venue:
  • ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we present the Relational BayesianClassifier (RBC), a modification of the Simple BayesianClassifier (SBC) for relational data. There exist severalBayesian classifiers that learn predictive models ofrelational data, but each uses a different estimationtechnique for modeling heterogeneous sets of attributevalues. The effects of data characteristics on estimationhave not been explored. We consider four simpleestimation techniques and evaluate them on three real-worlddata sets. The estimator that assumes each multisetvalue is independently drawn from the same distribution(INDEPVAL) achieves the best empirical results. Weexamine bias and variance tradeoffs over a range of datasets and show that INDEPVAL's ability to model moremultiset information results in lower bias estimates andcontributes to its superior performance.