Filtered document retrieval with frequency-sorted indexes
Journal of the American Society for Information Science
Scaling question answering to the Web
Proceedings of the 10th international conference on World Wide Web
Performing Group-By before Join
Proceedings of the Tenth International Conference on Data Engineering
Aggregate-Join Query Processing in Parallel Database Systems
HPC '00 Proceedings of the The Fourth International Conference on High-Performance Computing in the Asia-Pacific Region-Volume 2 - Volume 2
Question answering from the web using knowledge annotation and knowledge mining techniques
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
Web-scale information extraction in knowitall: (preliminary results)
Proceedings of the 13th international conference on World Wide Web
Fast phrase querying with combined indexes
ACM Transactions on Information Systems (TOIS)
Three-level caching for efficient query processing in large Web search engines
WWW '05 Proceedings of the 14th international conference on World Wide Web
A search engine for natural language applications
WWW '05 Proceedings of the 14th international conference on World Wide Web
Performance analysis of "Groupby-After-Join" query processing in parallel database systems
Information Sciences—Informatics and Computer Science: An International Journal
An analysis of the AskMSR question-answering system
EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Optimizing scoring functions and indexes for proximity search in type-annotated corpora
Proceedings of the 15th international conference on World Wide Web
Ranking objects based on relationships
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Avatar semantic search: a database approach to information retrieval
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Proceedings of the 16th international conference on World Wide Web
How NAGA uncoils: searching with entities and relations
Proceedings of the 16th international conference on World Wide Web
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Optimized query execution in large search engines with global page ordering
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
EntityRank: searching entities directly and holistically
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Performance of compressed inverted list caching in search engines
Proceedings of the 17th international conference on World Wide Web
Data-oriented content query system: searching for data into text on the web
Proceedings of the third ACM international conference on Web search and data mining
On caching search engine query results
Computer Communications
BOSS: a biomedical object search system
Proceedings of the ACM fifth international workshop on Data and text mining in biomedical informatics
Compressed data structures for annotated web search
Proceedings of the 21st international conference on World Wide Web
Hi-index | 0.00 |
Entity search, a significant departure from page-based retrieval, finds data, i.e., entities, embedded in documents directly and holistically across the whole collection. This paper aims at distilling and abstracting the essential computation requirements of entity search. From the dual views of reasoning--entity as input and entity as output, we propose a dual-inversion framework, with two indexing and partition schemes, towards efficient and scalable query processing. We systematically evaluate our framework using a prototype over a 3TB real Web corpus with 150M pages and over 20 entity types extracted. Our experiments in two concrete application settings show our techniques of on average, 2 to 4 orders of magnitude speed-up, over the keyword-based baseline, with reasonable space overhead.