Comparison of graph-based and logic-based multi-relational data mining

  • Authors:
  • Nikhil S. Ketkar;Lawrence B. Holder;Diane J. Cook

  • Affiliations:
  • University of Texas at Arlington;University of Texas at Arlington;University of Texas at Arlington

  • Venue:
  • ACM SIGKDD Explorations Newsletter
  • Year:
  • 2005

Quantified Score

Hi-index 0.01

Visualization

Abstract

We perform an experimental comparison of the graph-based multi-relational data mining system, Subdue, and the inductive logic programming system, CProgol, on the Mutagenesis dataset and various artificially generated Bongard problems. Experimental results indicate that Subdue can significantly outperform CProgol while discovering structurally large multi-relational concepts. It is also observed that CProgol is better at learning semantically complicated concepts and it tends to use background knowledge more effectively than Subdue. An analysis of the results indicates that the differences in the performance of the systems are a result of the difference in the expressiveness of the logic-based and the graph-based representations. The ability of graph-based systems to learn structurally large concepts comes from the use of a weaker representation whose expressiveness is intermediate between propositional and first-order logic. The use of this weaker representation is advantageous while learning structurally large concepts but it limits the learning of semantically complicated concepts and the utilization background knowledge.