Various approaches to text representation for named entity disambiguation

  • Authors:
  • Ivo Lašek;Peter Vojtáš

  • Affiliations:
  • Czech Technical University in Prague and Charles University in Prague, Prague, Czech Republic;Charles University in Prague, Prague, Czech Republic

  • Venue:
  • Proceedings of the 14th International Conference on Information Integration and Web-based Applications & Services
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we focus on the problem of named entity disambiguation. We disambiguate named entities on a very detailed level. To each entity is assigned a concrete identifier of a corresponding Wikipedia article describing the entity. For such a fine grained disambiguation a correct representation of a context is crucial. We compare various context representations: bag of words representation, linguistic representation and structured co-occurrence representation of the context. Models for each representation are described and evaluated.