Type less, find more: fast autocompletion search with a succinct index

Authors:
Holger Bast;Ingmar Weber
Affiliations:
Max-Planck-Institut für Informatik, Saarbrücken, Germany;Max-Planck-Institut für Informatik, Saarbrücken, Germany
Venue:
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Year:
2006

Citing 20
Cited 43

Autocompletion in full text transaction entry: a method for humanized input

CHI '86 Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
The input/output complexity of sorting and related problems

Communications of the ACM
The Reactive Keyboard: A Predictive Typing Aid

Computer
Query expansion using lexical-semantic relations

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Self-indexing inverted files for fast text retrieval

ACM Transactions on Information Systems (TOIS)
Inverted files versus signature files for text indexing

ACM Transactions on Database Systems (TODS)
Multidimensional access methods

ACM Computing Surveys (CSUR)
On two-dimensional indexability and optimal range search indexing

PODS '99 Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Managing gigabytes (2nd ed.): compressing and indexing documents and images

Managing gigabytes (2nd ed.): compressing and indexing documents and images
Scalable browsing for large collections: a case study

DL '00 Proceedings of the fifth ACM conference on Digital libraries
Placing search in context: the concept revisited

Proceedings of the 10th international conference on World Wide Web
New data structures for orthogonal range searching

FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
Optimal aggregation algorithms for middleware

Journal of Computer and System Sciences - Special issu on PODS 2001
Two-dimensional substring indexing

Journal of Computer and System Sciences - Special issu on PODS 2001
A commonsense approach to predictive text entry

CHI '04 Extended Abstracts on Human Factors in Computing Systems
Sentence completion

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Inverted Index Compression Using Word-Aligned Binary Codes

Information Retrieval
The TREC terabyte retrieval track

ACM SIGIR Forum
Indexing compressed text

Journal of the ACM (JACM)
Learning to complete sentences

ECML'05 Proceedings of the 16th European conference on Machine Learning

Indexing dataspaces

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Building simulated queries for known-item topics: an analysis using six european languages

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
ESTER: efficient search on text, entities, and relations

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Efficient interactive query expansion with complete search

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Output-sensitive autocompletion search

Information Retrieval
On content-driven search-keyword suggesters for literature digital libraries

Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries
A Phrase Recommendation Algorithm Based on Query Stream Mining in Web Search Engines

Algorithms and Models for the Web-Graph
Distributed, large-scale latent semantic analysis by index interpolation

Proceedings of the 3rd international conference on Scalable information systems
Compressed collections for simulated crawling

ACM SIGIR Forum
Efficient interactive fuzzy keyword search

Proceedings of the 18th international conference on World wide web
Interactive search in XML data

Proceedings of the 18th international conference on World wide web
Fast error-tolerant search on very large texts

Proceedings of the 2009 ACM symposium on Applied Computing
Efficient type-ahead search on relational data: a TASTIER approach

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Extending autocompletion to tolerate errors

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Efficient query expansion for advertisement search

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Automatic URL completion and prediction using fuzzy type-ahead search

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Fast Single-Pass Construction of a Half-Inverted Index

SPIRE '09 Proceedings of the 16th International Symposium on String Processing and Information Retrieval
Efficient two-sided error-tolerant search

Proceedings of the 2nd International Workshop on Keyword Search on Structured Data
SEQUEL: query completion via pattern mining on multi-column structural data

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Carbon: domain-independent automatic web form filling

ICWE'10 Proceedings of the 10th international conference on Web engineering
Context-sensitive query auto-completion

Proceedings of the 20th international conference on World wide web
Location-aware type ahead search on spatial databases: semantics and efficiency

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Efficient interactive smart keyword search

WISE'10 Proceedings of the 11th international conference on Web information systems engineering
Fast construction of the HYB index

ACM Transactions on Information Systems (TOIS)
Query suggestions in the absence of query logs

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Location-based instant search

SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
Efficient fuzzy full-text type-ahead search

The VLDB Journal — The International Journal on Very Large Data Bases
Exploiting available memory and disk for scalable instant overview search

WISE'11 Proceedings of the 12th international conference on Web information system engineering
A Survey of Automatic Query Expansion in Information Retrieval

ACM Computing Surveys (CSUR)
I/O-efficient data structures for colored range and prefix reporting

Proceedings of the twenty-third annual ACM-SIAM symposium on Discrete Algorithms
Output-Sensitive autocompletion search

SPIRE'06 Proceedings of the 13th international conference on String Processing and Information Retrieval
Algorithmic and user study of an autocompletion algorithm on a large medical vocabulary

Journal of Biomedical Informatics
Towards expressive exploratory search over entity-relationship data

Proceedings of the 21st international conference companion on World Wide Web
Scalable, flexible and generic instant overview search

Proceedings of the 21st international conference companion on World Wide Web
AutoComPaste: auto-completing text as an alternative to copy-paste

Proceedings of the International Working Conference on Advanced Visual Interfaces
Supporting efficient top-k queries in type-ahead search

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Location-aware instant search

Proceedings of the 21st ACM international conference on Information and knowledge management
Being picky: processing top-k queries with set-defined selections

Proceedings of the 21st ACM international conference on Information and knowledge management
Query suggestions for textual problem solution repositories

ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Looking ahead: query preview in exploratory search

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
An index for efficient semantic full-text search

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Efficient error-tolerant query autocompletion

Proceedings of the VLDB Endowment
An enterprise search paradigm based on extended query auto-completion: do we still need search and navigation?

Proceedings of the 18th Australasian Document Computing Symposium

Quantified Score

Hi-index	0.00

Visualization

Abstract

We consider the following full-text search autocompletion feature. Imagine a user of a search engine typing a query. Then with every letter being typed, we would like an instant display of completions of the last query word which would lead to good hits. At the same time, the best hits for any of these completions should be displayed. Known indexing data structures that apply to this problem either incur large processing times for a substantial class of queries, or they use a lot of space. We present a new indexing data structure that uses no more space than a state-of-the-art compressed inverted index, but with 10 times faster query processing times. Even on the large TREC Terabyte collection, which comprises over 25 million documents, we achieve, on a single machine and with the index on disk, average response times of one tenth of a second. We have built a full-fledged, interactive search engine that realizes the proposed autocompletion feature combined with support for proximity search, semi-structured (XML) text, subword and phrase completion, and semantic tags.