Proximity-based document representation for named entity retrieval

Authors:
Desislava Petkova;W. Bruce Croft
Affiliations:
University of Massachusetts, Amherst, MA;University of Massachusetts, Amherst, MA
Venue:
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Year:
2007

Citing 12
Cited 40

Automating the assignment of submitted manuscripts to reviewers

SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
A system for discovering relationships by feature extraction from text databases

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Effective document presentation with a locality-based similarity heuristic

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Relevance based language models

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
A study of smoothing methods for language models applied to Ad Hoc information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Text classification and named entities for new event detection

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
The effect of named entities on effectiveness in cross-language information retrieval evaluation

Proceedings of the 2005 ACM symposium on Applied computing
A Markov random field model for term dependencies

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Beyond PageRank: machine learning for static ranking

Proceedings of the 15th international conference on World Wide Web
Formal models for expert finding in enterprise corpora

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Hierarchical Language Models for Expert Finding in Enterprise Corpora

ICTAI '06 Proceedings of the 18th IEEE International Conference on Tools with Artificial Intelligence
Probabilistic models for expert finding

ECIR'07 Proceedings of the 29th European conference on IR research

Exploiting sequential dependencies for expert finding

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Combining document- and paragraph-based entity ranking

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Non-local evidence for expert finding

Proceedings of the 17th ACM conference on Information and knowledge management
Modeling multi-step relevance propagation for expert finding

Proceedings of the 17th ACM conference on Information and knowledge management
Modeling document features for expert finding

Proceedings of the 17th ACM conference on Information and knowledge management
A study of the relationship between ad hoc retrieval and expert finding in enterprise environment

Proceedings of the 10th ACM workshop on Web information and data management
A language modeling framework for expert finding

Information Processing and Management: an International Journal
Learning to rank for quantity consensus queries

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Positional language models for information retrieval

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Integrating multiple document features in language models for expert finding

Knowledge and Information Systems
Semantic annotation based exploratory search for information analysts

Information Processing and Management: an International Journal
Freshness matters: in flowers, food, and web authority

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Discriminative models of integrating document evidence and document-candidate associations for expert search

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Exploiting click-through data for entity retrieval

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Learning Aggregation Functions for Expert Search

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
Entity search: building bridges between two worlds

Proceedings of the 3rd International Semantic Search Workshop
Entity ranking using Wikipedia as a pivot

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Ranking related entities: components and analyses

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Comparison of IPC and USPC classification systems in patent prior art searches

PaIR '10 Proceedings of the 3rd international workshop on Patent information retrieval
Entity-relationship queries over wikipedia

SMUC '10 Proceedings of the 2nd international workshop on Search and mining user-generated contents
An analysis of learned proximity functions

RIAO '10 Adaptivity, Personalization and Fusion of Heterogeneous Information
Discriminative probabilistic models for expert search in heterogeneous information sources

Information Retrieval
A user-oriented model for expert finding

ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
CRTER: using cross terms to enhance probabilistic information retrieval

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Query modeling for entity search based on terms, categories, and examples

ACM Transactions on Information Systems (TOIS)
Tensor Field Model for higher-order information retrieval

Journal of Systems and Software
Learning to rank for expert search in digital libraries of academic publications

EPIA'11 Proceedings of the 15th Portugese conference on Progress in artificial intelligence
Ontology-based proximity search

Proceedings of the 13th International Conference on Information Integration and Web-based Applications and Services
Mining anchor text trends for retrieval

ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Can concept-based user modeling improve adaptive visualization?

UMAP'10 Proceedings of the 18th international conference on User Modeling, Adaptation, and Personalization
Entity-Relationship Queries over Wikipedia

ACM Transactions on Intelligent Systems and Technology (TIST)
Expertise Retrieval

Foundations and Trends in Information Retrieval
A ranking framework for entity oriented search using Markov random fields

Proceedings of the 1st Joint International Workshop on Entity-Oriented and Semantic Search
Finding the right supervisor: expert-finding in a university domain

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop
Recognition of word collocation habits using frequency rank ratio and inter-term intimacy

Expert Systems with Applications: An International Journal
Learning joint query interpretation and response ranking

Proceedings of the 22nd international conference on World Wide Web
Finding news story chains based on multi-dimensional event profiles

Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
Expertise retrieval in bibliographic network: a topic dominance learning approach

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Structured positional entity language model for enterprise entity retrieval

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Exploiting a proximity-based positional model to improve the quality of information extraction by text segmentation

ADC '13 Proceedings of the Twenty-Fourth Australasian Database Conference - Volume 137

Quantified Score

Hi-index	0.00

Visualization

Abstract

One aspect in which retrieving named entities is different from retrieving documents is that the items to be retrieved - persons, locations, organizations - are only indirectly described by documents throughout the collection. Much work has been dedicated to finding references to named entities, in particular to the problems of named entity extraction and disambiguation. However, just as important for retrieval performance is how these snippets of text are combined to build named entity representations. We focus on the TREC expert search task where the goal is to identify people who are knowledgeable on a specific topic. Existing language modeling techniques for expert finding assume that terms and person entities are conditionally independent given a document. We present theoretical and experimental evidence that this simplifying assumption ignores information on how named entities relate to document content. To address this issue, we propose a new document representation which emphasizes text in proximity to entities and thus incorporates sequential information implicit in text. Our experiments demonstrate that the proposed model significantly improves retrieval performance. The main contribution of this work is an effective formal method for explicitly modeling the dependency between the named entities and terms which appear in a document.