Simple decision forests for multi-relational classification

  • Authors:
  • Bahareh Bina;Oliver Schulte;Branden Crawford;Zhensong Qian;Yi Xiong

  • Affiliations:
  • -;-;-;-;-

  • Venue:
  • Decision Support Systems
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

An important task in multi-relational data mining is link-based classification which takes advantage of attributes of links and linked entities, to predict the class label. The relational Naive Bayes classifier exploits independence assumptions to achieve scalability. We introduce a weaker independence assumption to the effect that information from different data tables is independent given the class label. The independence assumption entails a closed-form formula for combining probabilistic predictions based on decision trees learned on different database tables. Logistic regression learns different weights for information from different tables and prunes irrelevant tables. In experiments, learning was very fast with competitive accuracy.