Using wiktionary for computing semantic relatedness

  • Authors:
  • Torsten Zesch;Christof Müller;Iryna Gurevych

  • Affiliations:
  • Ubiquitous Knowledge Processing Lab, Computer Science Department, Technische Universität Darmstadt, Darmstadt, Germany;Ubiquitous Knowledge Processing Lab, Computer Science Department, Technische Universität Darmstadt, Darmstadt, Germany;Ubiquitous Knowledge Processing Lab, Computer Science Department, Technische Universität Darmstadt, Darmstadt, Germany

  • Venue:
  • AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We introduce Wiktionary as an emerging lexical semantic resource that can be used as a substitute for expert-made resources in AI applications. We evaluate Wiktionary on the pervasive task of computing semantic relatedness for English and German by means of correlation with human rankings and solving word choice problems. For the first time, we apply a concept vector based measure to a set of different concept representations like Wiktionary pseudo glosses, the first paragraph of Wikipedia articles, English WordNet glosses, and GermaNet pseudo glosses. We show that: (i) Wiktionary is the best lexical semantic resource in the ranking task and performs comparably to other resources in the word choice task, and (ii) the concept vector based approach yields the best results on all datasets in both evaluations.