A Multistrategy Approach to Relational Knowledge Discovery inDatabases

  • Authors:
  • Katharina Morik;Peter Brockhausen

  • Affiliations:
  • Univ. Dortmund, Computer Science Department, LS VIII, D–44221 Dortmund. E-mail: morik@ls8.informatik.uni-dortmund.de, brockh@ls8.informatik.uni-dortmund.de;Univ. Dortmund, Computer Science Department, LS VIII, D–44221 Dortmund. E-mail: morik@ls8.informatik.uni-dortmund.de, brockh@ls8.informatik.uni-dortmund.de

  • Venue:
  • Machine Learning - Special issue on multistrategy learning
  • Year:
  • 1997

Quantified Score

Hi-index 0.00

Visualization

Abstract

When learning from very large databases, the reduction of complexityis extremely important. Two extremes of making knowledge discoveryin databases (KDD) feasible have been put forward. One extreme is tochoose a very simple hypothesis language, thereby being capable ofvery fast learning on real-world databases. The opposite extreme isto select a small data set, thereby being able to learn veryexpressive (first-order logic) hypotheses. A multistrategy approachallows one to include most of these advantages and exclude most ofthe disadvantages. Simpler learning algorithms detect hierarchieswhich are used to structure the hypothesis space for a more complexlearning algorithm. The better structured the hypothesis space is,the better learning can prune away uninteresting or losing hypothesesand the faster it becomes.We have combined inductive logic programming (ILP) directly with arelational database management system. The ILP algorithm iscontrolled in a model-driven way by the user and in a data-driven wayby structures that are induced by three simple learning algorithms.