Foundations of statistical natural language processing
Foundations of statistical natural language processing
A general language model for information retrieval
Proceedings of the eighth international conference on Information and knowledge management
Improving the effectiveness of information retrieval with local context analysis
ACM Transactions on Information Systems (TOIS)
Document language models, query models, and risk minimization for information retrieval
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
The effect of topic set size on retrieval experiment error
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Combining document representations for known-item search
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Accurate methods for the statistics of surprise and coincidence
Computational Linguistics - Special issue on using large corpora: I
Automatic acquisition of hyponyms from large text corpora
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Learning surface text patterns for a Question Answering system
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Formal models for expert finding in enterprise corpora
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Discovering relations among named entities from large corpora
ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Proximity-based document representation for named entity retrieval
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Using Wikipedia Categories and Links in Entity Ranking
Focused Access to XML Documents
Focused Access to XML Documents: 6th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2007, Schloss Dagstuhl, Germany
A language modeling framework for expert finding
Information Processing and Management: an International Journal
A Generative Language Modeling Approach for Ranking Entities
Advances in Focused Retrieval
Probabilistic models for expert finding
ECIR'07 Proceedings of the 29th European conference on IR research
Automatically generating extraction patterns from untagged text
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2
People searching for people: analysis of a people search engine log
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Bipartite Graph Based Entity Ranking for Related Entity Finding
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Finding dimensions for queries
Proceedings of the 20th ACM international conference on Information and knowledge management
Foundations and Trends in Information Retrieval
Combining inverted indices and structured search for ad-hoc object retrieval
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Hierarchical target type identification for entity-oriented queries
Proceedings of the 21st ACM international conference on Information and knowledge management
Example based entity search in the web of data
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
A test collection for entity search in DBpedia
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Structured positional entity language model for enterprise entity retrieval
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Learning relatedness measures for entity linking
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Using temporal bursts for query modeling
Information Retrieval
Hi-index | 0.00 |
Related entity finding is the task of returning a ranked list of homepages of relevant entities of a specified type that need to engage in a given relationship with a given source entity. We propose a framework for addressing this task and perform a detailed analysis of four core components; co-occurrence models, type filtering, context modeling and homepage finding. Our initial focus is on recall. We analyze the performance of a model that only uses co-occurrence statistics. While this method identifies the potential set of related entities, it fails to rank them effectively. Two types of error emerge: (1) entities of the wrong type pollute the ranking and (2) while somehow associated to the source entity, some retrieved entities do not engage in the right relation with it. To address (1), we add type filtering based on category information available in Wikipedia. To correct for (2), we complement our related entity finding method with contextual information, represented as language models derived from documents in which source and target entities co-occur. To complete the pipeline, we find homepages of top ranked entities by combining a language modeling approach with heuristics based on Wikipedia's external links. Our method achieves very high recall scores on the end-to-end task, providing a solid starting point for expanding our focus to improve precision. Our framework can effectively incorporate additional heuristics and these extensions lead to state-of-the-art performance.