Topic-Oriented words as features for named entity recognition

  • Authors:
  • Ziqi Zhang;Trevor Cohn;Fabio Ciravegna

  • Affiliations:
  • Department of Computer Science, University of Sheffield, Sheffield, UK;Department of Computer Science, University of Sheffield, Sheffield, UK;Department of Computer Science, University of Sheffield, Sheffield, UK

  • Venue:
  • CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Research has shown that topic-oriented words are often related to named entities and can be used for Named Entity Recognition. Many have proposed to measure topicality of words in terms of …informativeness' based on global distributional characteristics of words in a corpus. However, this study shows that there can be large discrepancy between informativeness and topicality; empirically, informativeness based features can damage learning accuracy of NER. This paper proposes to measure words' topicality based on local distributional features specific to individual documents, and proposes methods to transform topicality into gazetteer-like features for NER by binning. Evaluated using five datasets from three domains, the methods have shown consistent improvement over a baseline by between 0.9 and 4.0 in F-measure, and always outperformed methods that use informativeness measures.