Entity ranking in Wikipedia

Authors:
Anne-Marie Vercoustre;James A. Thom;Jovan Pehcevski
Affiliations:
INRIA, Rocquencourt, France;RMIT University, Melbourne, Australia;INRIA, Rocquencourt, France
Venue:
Proceedings of the 2008 ACM symposium on Applied computing
Year:
2008

Citing 16
Cited 18

The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Nodose version 2.0

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Authoritative sources in a hyperlinked environment

Journal of the ACM (JACM)
Wrapper induction: efficiency and expressiveness

Artificial Intelligence - Special issue on Intelligent internet systems
Knowledge-based extraction of named entities

Proceedings of the eleventh international conference on Information and knowledge management
Building Light-Weight Wrappers for Legacy Web Data-Sources Using W4F

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Mining data records in Web pages

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Exploiting dictionaries in named entity extraction: combining semi-Markov extraction processes and data integration methods

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Hybrid XML Retrieval: Combining Information Retrieval and a Native XML Database

Information Retrieval
GATE: an architecture for development of robust HLT applications

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Entity extraction without language-specific resources

COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
TREC: Experiment and Evaluation in Information Retrieval (Digital Libraries and Electronic Publishing)

TREC: Experiment and Evaluation in Information Retrieval (Digital Libraries and Electronic Publishing)
The Wikipedia XML corpus

ACM SIGIR Forum
Ontology evaluation using wikipedia categories for browsing

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Wrapper maintenance: a machine learning approach

Journal of Artificial Intelligence Research
Ontology-driven automatic entity disambiguation in unstructured text

ISWC'06 Proceedings of the 5th international conference on The Semantic Web

Using Wikipedia Categories and Links in Entity Ranking

Focused Access to XML Documents
Mining meaning from Wikipedia

International Journal of Human-Computer Studies
Annotating wikipedia articles with semantic tags for structured retrieval

Proceedings of the 2nd ACM workshop on Social web search and mining
Entity ranking using Wikipedia as a pivot

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
TAER: time-aware entity retrieval-exploiting the past to find relevant entities in news articles

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Entity-relationship queries over wikipedia

SMUC '10 Proceedings of the 2nd international workshop on Search and mining user-generated contents
Entity ranking in Wikipedia: utilising categories, links and topic difficulty prediction

Information Retrieval
Ranking entities using web search query logs

ECDL'10 Proceedings of the 14th European conference on Research and advanced technology for digital libraries
Voting for related entities

RIAO '10 Adaptivity, Personalization and Fusion of Heterogeneous Information
Query modeling for entity search based on terms, categories, and examples

ACM Transactions on Information Systems (TOIS)
Linking FRBR entities to LOD through semantic matching

TPDL'11 Proceedings of the 15th international conference on Theory and practice of digital libraries: research and advanced technology for digital libraries
SISP: a new framework for searching the informative subgraph based on PSO

Proceedings of the 20th ACM international conference on Information and knowledge management
Category-based query modeling for entity search

ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
FRBR-ML: a FRBR-based framework for semantic interoperability

Semantic Web
Entity-Relationship Queries over Wikipedia

ACM Transactions on Intelligent Systems and Technology (TIST)
Exploiting the category structure of Wikipedia for entity ranking

Artificial Intelligence
An evidence-based verification approach to extract entities and relations for knowledge base population

ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part I
Towards an enhanced and adaptable ontology by distilling and assembling online encyclopedias

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

The traditional entity extraction problem lies in the ability of extracting named entities from plain text using natural language processing techniques and intensive training from large document collections. Examples of named entities include organisations, people, locations, or dates. There are many research activities involving named entities; we are interested in entity ranking in the field of information retrieval. In this paper, we describe our approach to identifying and ranking entities from the INEX Wikipedia document collection. Wikipedia offers a number of interesting features for entity identification and ranking that we first introduce. We then describe the principles and the architecture of our entity ranking system, and introduce our methodology for evaluation. Our preliminary results show that the use of categories and the link structure of Wikipedia, together with entity examples, can significantly improve retrieval effectiveness.