Disambiguating Personal Names on the Web using Automatically Extracted Key Phrases

  • Authors:
  • Danushka Bollegala;Yutaka Matsuo;Mitsuru Ishizuka

  • Affiliations:
  • University of Tokyo, danushka@mi.ci.i.u-tokyo.ac.jp;Japanese National Institute of Advanced Industrial Science and Technology, y.matsuo@aist.go.jp;University of Tokyo, ishizuka@i.u-tokyo.ac.jp

  • Venue:
  • Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

When you search for information regarding a particular person on the web, a search engine returns many pages. Some of these pages may be for people with the same name. How can we disambiguate these different people with the same name? This paper presents an unsupervised algorithm which produces unique phrases to disambiguate different people with the same name (i.e. namesakes). Our algorithm takes in a personal name and outputs multiple sets of phrases which uniquely identify the different namesakes on the web. These phrases could then be added to the query to narrow down the search to a specific namesake. We evaluated the algorithm on a collection of documents retreived from the Web. Experimental results show a significant improvement over the existing methods proposed for this task.