Unsupervised learning of pattern templates from unannotated corpora for proper noun extraction

  • Authors:
  • Seung-Shik Kang;Chong-Woo Woo

  • Affiliations:
  • School of Computer Science, Kookmin University & AITrc, Seoul, Korea;School of Computer Science, Kookmin University & AITrc, Seoul, Korea

  • Venue:
  • RSFDGrC'03 Proceedings of the 9th international conference on Rough sets, fuzzy sets, data mining, and granular computing
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes an approach to extracting proper nouns in the very large text corpora without using the lexicon or cue word dictionary. At first, we train the pattern for extracting the proper nouns by applying the initial proper names into the unannotated corpora that does not have any tags yet. And then we continuously apply the pattern templates into the corpora in order to extract new proper nouns until certain period.