Multilingual name disambiguation with semantic information

  • Authors:
  • Zornitsa Kozareva;Sonia Vázquez;Andrés Montoyo

  • Affiliations:
  • Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante;Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante;Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante

  • Venue:
  • TSD'07 Proceedings of the 10th international conference on Text, speech and dialogue
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper studies the problem of name ambiguity which concerns the discovery of the different underlying meanings behind a name. We have developed a semantic approach on the basis of which a graph-based clustering algorithm determines the sets of the semantically related sentences that talk about the same name. Our approach is evaluated with the Bulgarian, Romanian, Spanish and English languages for various couples of city, country, person and organization names. The yielded results significantly outperform a majority based classifier and are compared to a bigram co-occurrence approach.