Combining inverted indices and structured search for ad-hoc object retrieval

Authors:
Alberto Tonon;Gianluca Demartini;Philippe Cudré-Mauroux
Affiliations:
University of Fribourg, Fribourg, Switzerland;University of Fribourg, Fribourg, Switzerland;University of Fribourg, Fribourg, Switzerland
Venue:
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Year:
2012

Citing 20
Cited 9

How reliable are the results of large-scale information retrieval experiments?

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
The Philosophy of Information Retrieval Evaluation

CLEF '01 Revised Papers from the Second Workshop of the Cross-Language Evaluation Forum on Evaluation of Cross-Language Information Retrieval Systems
Retrieval evaluation with incomplete information

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Estimating average precision with incomplete and imperfect judgments

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
A language modeling framework for expert finding

Information Processing and Management: an International Journal
The Probabilistic Relevance Framework: BM25 and Beyond

Foundations and Trends in Information Retrieval
The RDF-3X engine for scalable management of RDF data

The VLDB Journal — The International Journal on Very Large Data Bases
Leveraging personal metadata for Desktop search: The Beagle++ system

Web Semantics: Science, Services and Agents on the World Wide Web
Ad-hoc object retrieval in the web of data

Proceedings of the 19th international conference on World wide web
Entity ranking using Wikipedia as a pivot

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Ranking related entities: components and analyses

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
TAER: time-aware entity retrieval-exploiting the past to find relevant entities in news articles

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Why finding entities in Wikipedia is difficult, sometimes

Information Retrieval
Overview of the INEX 2009 entity ranking track

INEX'09 Proceedings of the Focused retrieval and evaluation, and 8th international conference on Initiative for the evaluation of XML retrieval
Repeatable and reliable search system evaluation using crowdsourcing

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Query modeling for entity search based on terms, categories, and examples

ACM Transactions on Information Systems (TOIS)
Effective and efficient entity search in RDF data

ISWC'11 Proceedings of the 10th international conference on The semantic web - Volume Part I
Keyword search over RDF graphs

Proceedings of the 20th ACM international conference on Information and knowledge management
Lightweight integration of IR and DB for scalable hybrid search with integrated ranking support

Web Semantics: Science, Services and Agents on the World Wide Web
A node indexing scheme for web entity retrieval

ESWC'10 Proceedings of the 7th international conference on The Semantic Web: research and Applications - Volume Part II

Tag recommendation for large-scale ontology-based information systems

ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part II
Example based entity search in the web of data

ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Searching the web of data

ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Pick-a-crowd: tell me what you like, and i'll tell you what to do

Proceedings of the 22nd international conference on World Wide Web
Structured positional entity language model for enterprise entity retrieval

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Improving entity search over linked data by modeling latent semantics

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Unicorn: a system for searching the social graph

Proceedings of the VLDB Endowment
Large-scale linked data integration using probabilistic reasoning and crowdsourcing

The VLDB Journal — The International Journal on Very Large Data Bases
Effective named entity recognition for idiosyncratic web collections

Proceedings of the 23rd international conference on World wide web

Quantified Score

Hi-index	0.00

Visualization

Abstract

Retrieving semi-structured entities to answer keyword queries is an increasingly important feature of many modern Web applications. The fast-growing Linked Open Data (LOD) movement makes it possible to crawl and index very large amounts of structured data describing hundreds of millions of entities. However, entity retrieval approaches have yet to find efficient and effective ways of ranking and navigating through those large data sets. In this paper, we address the problem of Ad-hoc Object Retrieval over large-scale LOD data by proposing a hybrid approach that combines IR and structured search techniques. Specifically, we propose an architecture that exploits an inverted index to answer keyword queries as well as a semi-structured database to improve the search effectiveness by automatically generating queries over the LOD graph. Experimental results show that our ranking algorithms exploiting both IR and graph indices outperform state-of-the-art entity retrieval techniques by up to 25% over the BM25 baseline.