Knowledge Discovery from Structured Data by Beam-Wise Graph-Based Induction

Authors:
Takashi Matsuda;Hiroshi Motoda;Tetsuya Yoshida;Takashi Washio
Affiliations:
-;-;-;-
Venue:
PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Year:
2002

Citing 8
Cited 7

Learning flexible concepts: fundamental ideas and a method based on two-tiered representation

Machine learning
C4.5: programs for machine learning

C4.5: programs for machine learning
CLIP: concept learning from inference patterns

Artificial Intelligence - Special issue: AI research in Japan
Graph-Based Data Mining

IEEE Intelligent Systems
The CN2 Induction Algorithm

Machine Learning
Induction of Decision Trees

Machine Learning
An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data

PKDD '00 Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery
Extension of Graph-Based Induction for General Graph Structured Data

PADKK '00 Proceedings of the 4th Pacific-Asia Conference on Knowledge Discovery and Data Mining, Current Issues and New Applications

Extracting frequent connected subgraphs from large graph sets

Journal of Computer Science and Technology
Constructing a Decision Tree for Graph-Structured Data and its Applications

Fundamenta Informaticae - Advances in Mining Graphs, Trees and Sequences
An efficient algorithm of frequent connected subgraph extraction

PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
Classifier construction by graph-based induction for graph-structured data

PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
Analysis of hepatitis dataset by decision tree based on graph-based induction

JSAI'03/JSAI04 Proceedings of the 2003 and 2004 international conference on New frontiers in artificial intelligence
Extracting diagnostic knowledge from hepatitis dataset by decision tree graph-based induction

AM'03 Proceedings of the Second international conference on Active Mining
Constructing a Decision Tree for Graph-Structured Data and its Applications

Fundamenta Informaticae - Advances in Mining Graphs, Trees and Sequences

Quantified Score

Hi-index	0.00

Visualization

Abstract

A machine learning technique called Graph-Based Induction (GBI) extracts typical patterns from graph data by stepwise pair expansion (pairwise chunking). Because of its greedy search strategy, it is very efficient but suffers from incompleteness of search. We improved its search capability without imposing much computational complexity by incorporating the idea of beam search. Additional improvement is made to extract patterns that are more discriminative than those simply occurring frequently, and to enumerate identical patterns accurately based on the notion of canonical labeling. This new algorithm was implemented (now called Beam-wise GBI, B-GBI for short) and tested against a DNA data set from UCI repository. Since DNA data is a sequence of symbols, representing each sequence by attribute-value pairs by simply assigning these symbols to the values of ordered attributes does not make sense. By transforming the sequence into a graph structure and running B-GBI it is possible to extract discriminative substructures. These can be new attributes for a classification problem. Effect of beam width on the number of discovered attributes and predictive accuracy was evaluated, together with extracted characteristic subsequences, and the results indicate the effectiveness of B-GBI.