Wide-Coverage Spanish Named Entity Extraction

  • Authors:
  • Xavier Carreras;Lluís Màrquez;Lluís Padró

  • Affiliations:
  • -;-;-

  • Venue:
  • IBERAMIA 2002 Proceedings of the 8th Ibero-American Conference on AI: Advances in Artificial Intelligence
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a proposal for wide-coverage Named Entity-Extraction for Spanish. The extraction of named entities is treated using robust Machine Learning techniques (AdaBoost) and simple attributes requiring non-linguistically processed corpora, complemented with external information sources (a list of trigger words and a gazetteer). A thorough evaluation of the task on real corpora is presented in order to validate the appropriateness of the approach. The non linguistic nature of used features makes the approach easily portable to other languages.