A New Model of Information Content for Semantic Similarity in WordNet

  • Authors:
  • Zili Zhou;Yanna Wang;Junzhong Gu

  • Affiliations:
  • -;-;-

  • Venue:
  • FGCNS '08 Proceedings of the 2008 Second International Conference on Future Generation Communication and Networking Symposia - Volume 03
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Information Content(IC) is an important dimension of assessing the semantic similarity between two terms or word senses in word knowledge. The conventional method of obtaining IC of word senses is to combine knowledge of their hierarchical structure from an ontology like WordNet with actual usage in text as derived from a large corpus. In this paper, a new model of IC is presented, which relies on hierarchical structure alone. The model considers not only the hyponyms of each word sense but also its depth in the structure. The IC value is easier to calculate based on our model, and when used as the basis of a similarity approach it yields judgments that correlate more closely with human assessments than others, which using IC value obtained only considering the hyponyms and IC value got by employing corpus analysis.