Exploiting Wikipedia for Directional Inferential Text Similarity

  • Authors:
  • Leong Chee Wee;Samer Hassan

  • Affiliations:
  • -;-

  • Venue:
  • ITNG '08 Proceedings of the Fifth International Conference on Information Technology: New Generations
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In natural languages, variability of semantic expression refers to the situation where the same meaning can be inferred from different words or texts. Given that many natural language processing tasks nowadays (e.g. question answering, information retrieval, document summarization) often model this variability by requiring a specific targetmeaning to be inferred from different text variants, it is helpful to capture text similarity in a directional manner to serve such inference needs. In this paper, we show how Wikipedia can be used as a semantic resource to build a directional inferential similarity metric between words, and subsequently, texts. Through experiments, we show that our Wikipedia-based metric performs significantly better when applied to a standard evaluation dataset, with a reduction in error rate of 16.1% over the random metric baseline.