The role of named entities in web people search

Authors:
Javier Artiles;Enrique Amigó;Julio Gonzalo
Affiliations:
UNED NLP & IR group, Madrid, Spain;UNED NLP & IR group, Madrid, Spain;UNED NLP & IR group, Madrid, Spain
Venue:
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Year:
2009

Citing 13
Cited 14

Entity-based cross-document coreferencing using the Vector Space Model

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Grouping search-engine returned citations for person-name queries

Proceedings of the 6th annual ACM international workshop on Web information and data management
A testbed for people searching strategies in the WWW

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Person resolution in person search results: WebHawk

Proceedings of the 14th ACM international conference on Information and knowledge management
Unsupervised personal name disambiguation

CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Multi-document statistical fact extraction and fusion

Multi-document statistical fact extraction and fusion
Word Sense Disambiguation: Algorithms and Applications (Text, Speech and Language Technology)

Word Sense Disambiguation: Algorithms and Applications (Text, Speech and Language Technology)
Is Hillary Rodham Clinton the president?: disambiguating names across documents

CorefApp '99 Proceedings of the Workshop on Coreference and its Applications
The SemEval-2007 WePS evaluation: establishing a benchmark for the web people search task

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
CU-COMSEM: exploring rich features for unsupervised web personal name disambiguation

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
IRST-BP: web people search using name entities

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
TITPI: web people search task using semi-supervised clustering approach

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
UC3M_13: disambiguation of person names based on the composition of simple bags of typed terms

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations

SemEval-2010 task 14: Word sense induction & disambiguation

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
Duluth-WSI: SenseClusters applied to the sense induction task of SemEval-2

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
Entity linking leveraging: automatically generated annotation

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
A quantitative evaluation of global word sense induction

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Latent semantic word sense induction and disambiguation

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Word sense induction by community detection

TextGraphs-6 Proceedings of TextGraphs-6: Graph-based Methods for Natural Language Processing
Unsupervised name ambiguity resolution using a generative model

EMNLP '11 Proceedings of the First Workshop on Unsupervised Learning in NLP
Measuring the impact of sense similarity on word sense induction

EMNLP '11 Proceedings of the First Workshop on Unsupervised Learning in NLP
Automatic identification of protagonist in fairy tales using verb

PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
An evaluation of graded sense disambiguation using word sense induction

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
How do humans distinguish different people with identical names on the web?

Proceedings of the 21st ACM international conference on Information and knowledge management
MaxMax: a graph-based soft clustering algorithm applied to word sense induction

CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Automatic dominant character identification in fables based on verb analysis - Empirical study on the impact of anaphora resolution

Knowledge-Based Systems
Evaluating Word Sense Induction and Disambiguation Methods

Language Resources and Evaluation

Quantified Score

Hi-index	0.00

Visualization

Abstract

The ambiguity of person names in the Web has become a new area of interest for NLP researchers. This challenging problem has been formulated as the task of clustering Web search results (returned in response to a person name query) according to the individual they mention. In this paper we compare the coverage, reliability and independence of a number of features that are potential information sources for this clustering task, paying special attention to the role of named entities in the texts to be clustered. Although named entities are used in most approaches, our results show that, independently of the Machine Learning or Clustering algorithm used, named entity recognition and classification per se only make a small contribution to solve the problem.