Exploiting Web querying for Web people search

Authors:
Rabia Nuray-Turan;Dmitri V. Kalashnikov;Sharad Mehrotra
Affiliations:
University of California, Irvine;University of California, Irvine;University of California, Irvine
Venue:
ACM Transactions on Database Systems (TODS)
Year:
2012

Citing 34
Cited 4

The Skyline Operator

Proceedings of the 17th International Conference on Data Engineering
Entity-based cross-document coreferencing using the Vector Space Model

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Correlation Clustering

Machine Learning
Disambiguating Web appearances of people in a social network

WWW '05 Proceedings of the 14th international conference on World Wide Web
A testbed for people searching strategies in the WWW

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Exploiting relationships for object consolidation

Proceedings of the 2nd international workshop on Information quality in information systems
Semantic integration in text: from ambiguous names to identifiable entities

AI Magazine - Special issue on semantic integration
Person resolution in person search results: WebHawk

Proceedings of the 14th ACM international conference on Information and knowledge management
Unsupervised personal name disambiguation

CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Domain-independent data cleaning via analysis of entity-relationship graph

ACM Transactions on Database Systems (TODS)
Stanford WebBase components and applications

ACM Transactions on Internet Technology (TOIT)
Weakly supervised learning for cross-document person name disambiguation supported by information extraction

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Measuring semantic similarity between words using web search engines

Proceedings of the 16th international conference on World Wide Web
Adaptive graphical approach to entity resolution

Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Shooting stars in the sky: an online algorithm for skyline queries

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Web based linkage

Proceedings of the 9th annual ACM international workshop on Web information and data management
Improving the performance of personal name disambiguation using web directories

Information Processing and Management: an International Journal
Towards breaking the quality curse.: a web-querying approach to web people search.

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Web People Search via Connection Analysis

IEEE Transactions on Knowledge and Data Engineering
Exploiting context analysis for combining multiple entity resolution systems

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
The SemEval-2007 WePS evaluation: establishing a benchmark for the web people search task

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
CU-COMSEM: exploring rich features for unsupervised web personal name disambiguation

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
JHU1: an unsupervised approach to person name disambiguation using web snippets

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
PSNUS: web people name disambiguation by simple clustering with rich features

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
WIT: web people search disambiguation using random walks

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
Improving author coreference by resource-bounded information gathering from the web

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Computing semantic relatedness using Wikipedia-based explicit semantic analysis

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Named entity disambiguation by leveraging wikipedia semantic knowledge

Proceedings of the 18th ACM conference on Information and knowledge management
GRAPE: A Graph-Based Framework for Disambiguating People Appearances in Web Search

ICDM '09 Proceedings of the 2009 Ninth IEEE International Conference on Data Mining
Self-tuning in graph-based reference disambiguation

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Person name disambiguation in web pages using social network, compound words and latent topics

PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Person name disambiguation by bootstrapping

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Structural semantic relatedness: a knowledge-based method to named entity disambiguation

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Attribute and object selection queries on objects with probabilistic attributes

ACM Transactions on Database Systems (TODS)

Adaptive Connection Strength Models for Relationship-Based Entity Resolution

Journal of Data and Information Quality (JDIQ) - Special Issue on Entity Resolution
A unified framework for context assisted face clustering

Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
Towards a fair comparison between name disambiguation approaches

Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
Query-driven approach to entity resolution

Proceedings of the VLDB Endowment

Quantified Score

Hi-index	0.00

Visualization

Abstract

Searching for people on the Web is one of the most common query types submitted to Web search engines today. However, when a person name is queried, the returned Webpages often contain documents related to several distinct namesakes who have the queried name. The task of disambiguating and finding the Webpages related to the specific person of interest is left to the user. Many Web People Search (WePS) approaches have been developed recently that attempt to automate this disambiguation process. Nevertheless, the disambiguation quality of these techniques leaves major room for improvement. In this article, we present a new WePS approach. It is based on issuing additional auxiliary queries to the Web to gain additional knowledge about the Webpages that need to be disambiguated. Thus, the approach uses the Web as an external data source by issuing queries to collect co-occurrence statistics. These statistics are used to assess the overlap of the contextual entities extracted from the Webpages. The article also proposes a methodology to make this Web querying technique efficient. Further, the article proposes an approach that is capable of combining various types of disambiguating information, including other common types of similarities, by applying a correlation clustering approach with after-clustering of singleton clusters. These properties allow the framework to get an advantage in terms of result quality over other state-of-the-art WePS techniques.