Integrating Document Features for Entity Ranking

Authors:
Jianhan Zhu;Dawei Song;Stefan Rüger
Affiliations:
Knowledge Media Institute, The Open University, United Kingdom;Knowledge Media Institute, The Open University, United Kingdom;Knowledge Media Institute, The Open University, United Kingdom
Venue:
Focused Access to XML Documents
Year:
2008

Citing 1
Cited 4

A system for discovering relationships by feature extraction from text databases

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval

Using Wikipedia Categories and Links in Entity Ranking

Focused Access to XML Documents
Query modeling for entity search based on terms, categories, and examples

ACM Transactions on Information Systems (TOIS)
Category-based query modeling for entity search

ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Expertise Retrieval

Foundations and Trends in Information Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

The Knowledge Media Institute of the Open University participated in the entity ranking and entity list completion tasks of the Entity Ranking Track in INEX 2007. In both the entity ranking and entity list completion tasks, we have considered document features in addition to a basic document content based relevance model. These document features include categorizations of documents, relevance of category names to the query, and hierarchical relations between categories. Furthermore, based on our TREC2006 and 2007 expert search approach, we applied a co-occurrence based entity association discovery model to the two tasks based on the assumption that relevant entities often co-occur with query terms or given relevant entities in documents. Our initial experimental results show that, by considering the predefined category, its children and grandchildren in the document content based relevance model, the performance of our entity ranking approach can be significantly improved. Consideration of the predefined category's parents, a category name based relevance model, and the co-occurrence model is not shown to be helpful in entity ranking and list completion, respectively.