Using WordNet to Measure the Similarity of Link Texts

  • Authors:
  • Andrzej Siemiński

  • Affiliations:
  • Institute of Informatics, Wrocław University of Technology, Wroclaw, Poland 50-370

  • Venue:
  • ICCCI '09 Proceedings of the 1st International Conference on Computational Collective Intelligence. Semantic Web, Social Networks and Multiagent Systems
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The primary aim of the study is to evaluate the extend to which the introduction of word similarity defined by the WordNet database could improve the link texts similarity assessment. The proper assessment is crucial for focused crawlers. The crawlers need it to select which links are to be followed. The proposed WordNet based semantic similarity algorithm has increased the recall of the link selection process. To mitigate the co-occurring loss of precision an adaptive algorithm for modifying the initial word similarity levels is introduced and evaluated. The proposed algorithms are verified by an experiment.