Disambiguating Personal Names on the Web using Automatically Extracted Key Phrases

Authors:
Danushka Bollegala;Yutaka Matsuo;Mitsuru Ishizuka
Affiliations:
University of Tokyo, danushka@mi.ci.i.u-tokyo.ac.jp;Japanese National Institute of Advanced Industrial Science and Technology, y.matsuo@aist.go.jp;University of Tokyo, ishizuka@i.u-tokyo.ac.jp
Venue:
Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
Year:
2006

Citing 13
Cited 9

The merge/purge problem for large databases

SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet

CICLing '02 Proceedings of the Third International Conference on Computational Linguistics and Intelligent Text Processing
On clusterings-good, bad and spectral

FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
Automatic word sense discrimination

Computational Linguistics - Special issue on word sense disambiguation
Entity-based cross-document coreferencing using the Vector Space Model

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Extracting nested collocations

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Information-theoretic tools for mining database structure from large data sets

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Disambiguating Web appearances of people in a social network

WWW '05 Proceedings of the 14th international conference on World Wide Web
Semantic integration in text: from ambiguous names to identifiable entities

AI Magazine - Special issue on semantic integration
Unsupervised personal name disambiguation

CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
POLYPHONET: an advanced social network extraction system from the web

Proceedings of the 15th international conference on World Wide Web
Finding predominant word senses in untagged text

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Name discrimination by clustering similar contexts

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing

Measuring semantic similarity between words using web search engines

Proceedings of the 16th international conference on World Wide Web
POLYPHONET: An advanced social network extraction system from the Web

Web Semantics: Science, Services and Agents on the World Wide Web
Using web information for author name disambiguation

Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries
Robust estimation of Google counts for social network extraction

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
UC3M_13: disambiguation of person names based on the composition of simple bags of typed terms

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
UNN-WePS: web person search using co-present names and lexical Chains

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
Clustering web people search results using fuzzy ants

Information Sciences: an International Journal
Exploiting macro and micro relations toward web intelligence

PRICAI'10 Proceedings of the 11th Pacific Rim international conference on Trends in artificial intelligence
Context similarity measure using Fuzzy Formal Concept Analysis

Proceedings of the Second International Conference on Computational Science, Engineering and Information Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

When you search for information regarding a particular person on the web, a search engine returns many pages. Some of these pages may be for people with the same name. How can we disambiguate these different people with the same name? This paper presents an unsupervised algorithm which produces unique phrases to disambiguate different people with the same name (i.e. namesakes). Our algorithm takes in a personal name and outputs multiple sets of phrases which uniquely identify the different namesakes on the web. These phrases could then be added to the query to narrow down the search to a specific namesake. We evaluated the algorithm on a collection of documents retreived from the Web. Experimental results show a significant improvement over the existing methods proposed for this task.