A two-view cotraining rule induction system for information extraction

  • Authors:
  • Jing Xiao

  • Affiliations:
  • Department of Computer Science, SUN Yat-Sen University, Guangzhou, China

  • Venue:
  • ICIC'06 Proceedings of the 2006 international conference on Intelligent computing: Part II
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Information extraction is becoming an important task due to the vast growth of the online texts. Pattern rule induction is one kind of main methods to do information extraction. Manually constructing pattern rules is tedious and error prone. In this paper, we present GRID_CoTrain, a weakly supervised paradigm by bootstrapping GRID (a supervised rule induction system) with cotraining and active learning. We also utilize external knowledge resource such as WordNet and existing ontology knowledge to optimize the learned pattern rules.