A Web-Based Relatedness Measure by Conditional Query

  • Authors:
  • Ming-Shun Lin;Hsin-Hsi Chen

  • Affiliations:
  • -;-

  • Venue:
  • WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper defines a novel relatedness measure by conditional query, explores snippets in various web domains as corpora, and evaluates the relatedness measure on three famous benchmarks, including WordSimilarity-353, Miller-Charles and Rubenstein-Goodenough datasets. Conditional query QY|X on a web domain estimates frequency fY|X by querying Y to search engine results of X. Dependency score is in terms of frequencies fY|X and fX|Y, and content overlap of search results of X and Y by various operations. A transfer function projects dependency score to mutual dependency of X and Y. Two transfer functions based on Poisson and Gompertz models are considered. Gompertz model reports the correlation score 0.706 in the WordSimilarity-353 dataset. Gompertz model also shows the best performance among all the web-based approaches in Rubenstein-Goodenough and Miller-Charles datasets.