A WordNet-based approach to Named Entities recognition

  • Authors:
  • Bernardo Magnini;Matteo Negri;Roberto Prevete;Hristo Tanev

  • Affiliations:
  • Centro per la Ricerca Scientifica e Tecnologica;Centro per la Ricerca Scientifica e Tecnologica;Centro per la Ricerca Scientifica e Tecnologica;Centro per la Ricerca Scientifica e Tecnologica

  • Venue:
  • SEMANET '02 Proceedings of the 2002 workshop on Building and using semantic networks - Volume 11
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a Named Entities (NE) recognition system for the English written language, which combines the wealth of the WordNet taxonomy and the effectiveness of traditional rule-based approaches. The core of the system relies on the combination of approximately 200 language-dependent rules with a set of predicates, defined on the WordNet hierarchy, for the identification of both proper nouns and trigger words. The strengths of this approach are twofold. First, the use of a semantic network allows it to cope with the difficulty of building and maintaining extensive gazetteers. Second, considering the recent spread of WordNet-like semantic networks for languages other than English and aligned with the English version, the use of language-independent predicates offers a useful basis for achieving multilinguality.