Semiautomatic extension of CoreNet using a bootstrapping mechanism on corpus-based co-occurrences

  • Authors:
  • Chris Biemann;Sa-Im Shin;Key-Sun Choi

  • Affiliations:
  • University of Leipzig, Augustusplatz, Leipzig, Germany;KORTERM, Division of Computer Science, Kusung Yusong, Daejon, Korea;KORTERM, Division of Computer Science, Kusung Yusong, Daejon, Korea

  • Venue:
  • COLING '04 Proceedings of the 20th international conference on Computational Linguistics
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

The paper describes a language-independent approach for semiautomatic extension of lexical-semantic word nets and evaluates the method on CoreNet, the Korean version of word net. In a bootstrapping fashion, the so-called 'Pendulum Algorithm' operates on word sets obtained by co-occurrence statistics on a large un-annotated corpus and keeps error propagation low by a verification step. Results are not sufficient for automatic extension, but provide a good candidate set. Further improvements are discussed.