Graph-Based Induction for General Graph Structured Data and Its Application to Chemical Compound Data

  • Authors:
  • Takashi Matsuda;Tadashi Horiuchi;Hiroshi Motada;Takashi Washio

  • Affiliations:
  • -;-;-;-

  • Venue:
  • DS '00 Proceedings of the Third International Conference on Discovery Science
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

Most of the relations are represented by a graph structure, e.g., chemical bonding, Web browsing record, DNA sequence, Inference pattern (program trace), to name a few. Thus, efficiently finding characteristic substructures in a graph will be a useful technique in many important KDD/ML applications. However, graph pattern matching is a hard problem. We propose a machine learning technique called Graph-Based Induction (GBI) that efficiently extracts typical patterns from graph data in an approximate manner by stepwise pair expansion (pairwise chunking). It can handle general graph structured data, i.e., directed/ undirected, colored/uncolored graphs with/without (self) loop and with colored/uncolored links. We show that its time complexity is almost linear with the size of graph. We, further, show that GBI can effectively be applied to the extraction of typical patterns from chemical compound data from which to generate classification rules, and that GBI also works as a feature construction component for other machine learning tools.