Term clustering of syntactic phrases
SIGIR '90 Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
The use of phrases and structured queries in information retrieval
SIGIR '91 Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval
Overview of the second text retrieval conference (TREC-2)
TREC-2 Proceedings of the second conference on Text retrieval conference
Filtered document retrieval with frequency-sorted indexes
Journal of the American Society for Information Science
Self-indexing inverted files for fast text retrieval
ACM Transactions on Information Systems (TOIS)
Exploring the similarity space
ACM SIGIR Forum
Phrase recognition and expansion for short, precision-biased queries based on a query log
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Managing gigabytes (2nd ed.): compressing and indexing documents and images
Managing gigabytes (2nd ed.): compressing and indexing documents and images
Scalable browsing for large collections: a case study
DL '00 Proceedings of the fifth ACM conference on Digital libraries
Improving browsing in digital libraries with keyphrase indexes
Decision Support Systems - From information retrieval to knowledge management: enabling technologies and best practices
Searching the Web: the public and their queries
Journal of the American Society for Information Science and Technology
A review of web searching studies and a framework for future research
Journal of the American Society for Information Science and Technology
Vector-space ranking with effective early termination
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Rank-preserving two-level caching for scalable search engines
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
In-memory hash tables for accumulating text vocabularies
Information Processing Letters
Optimised phrase querying and browsing of large text databases
ACSC '01 Proceedings of the 24th Australasian conference on Computer science
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Engineering a multi-purpose test collection for web retrieval experiments
Information Processing and Management: an International Journal
Inverted files for text search engines
ACM Computing Surveys (CSUR)
Efficient online index maintenance for contiguous inverted lists
Information Processing and Management: an International Journal
Effective and efficient object-based image retrieval using visual phrases
MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Investigating sentence weighting components for automatic summarisation
Information Processing and Management: an International Journal
Heavy-tailed distributions and multi-keyword queries
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Constructing visual phrases for effective and efficient object-based image retrieval
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Can phrase indexing help to process non-phrase queries?
Proceedings of the 17th ACM conference on Information and knowledge management
Out of the Box Phrase Indexing
SPIRE '08 Proceedings of the 15th International Symposium on String Processing and Information Retrieval
Top-k aggregation using intersections of ranked inputs
Proceedings of the Second ACM International Conference on Web Search and Data Mining
Efficient interactive fuzzy keyword search
Proceedings of the 18th international conference on World wide web
Efficient type-ahead search on relational data: a TASTIER approach
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Beyond pages: supporting efficient, scalable entity search with dual-inversion index
Proceedings of the 13th International Conference on Extending Database Technology
Efficient text proximity search
SPIRE'07 Proceedings of the 14th international conference on String processing and information retrieval
SSRS: an XML information retrieval system
DNIS'07 Proceedings of the 5th international conference on Databases in networked information systems
Index structures for efficiently searching natural language text
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Efficient interactive smart keyword search
WISE'10 Proceedings of the 11th international conference on Web information systems engineering
Inverted indexes for phrases and strings
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Efficient fuzzy full-text type-ahead search
The VLDB Journal — The International Journal on Very Large Data Bases
Efficiently encoding term co-occurrences in inverted indexes
Proceedings of the 20th ACM international conference on Information and knowledge management
Efficient phrase querying with flat position index
Proceedings of the 20th ACM international conference on Information and knowledge management
Object-oriented XML keyword search
ER'11 Proceedings of the 30th international conference on Conceptual modeling
High-performance processing of text queries with tunable pruned term and term pair indexes
ACM Transactions on Information Systems (TOIS)
Structured index organizations for high-throughput text querying
SPIRE'06 Proceedings of the 13th international conference on String Processing and Information Retrieval
Effectively scoring for XML IR queries
DEXA'06 Proceedings of the 17th international conference on Database and Expert Systems Applications
Supporting efficient top-k queries in type-ahead search
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Indexing Word Sequences for Ranked Retrieval
ACM Transactions on Information Systems (TOIS)
Document vector representations for feature extraction in multi-stage document ranking
Information Retrieval
Hi-index | 0.00 |
Search engines need to evaluate queries extremely fast, a challenging task given the quantities of data being indexed. A significant proportion of the queries posed to search engines involve phrases. In this article we consider how phrase queries can be efficiently supported with low disk overheads. Our previous research has shown that phrase queries can be rapidly evaluated using nextword indexes, but these indexes are twice as large as conventional inverted files. Alternatively, special-purpose phrase indexes can be used, but it is not feasible to index all phrases. We propose combinations of nextword indexes and phrase indexes with inverted files as a solution to this problem. Our experiments show that combined use of a partial nextword, partial phrase, and conventional inverted index allows evaluation of phrase queries in a quarter the time required to evaluate such queries with an inverted file alone; the additional space overhead is only 26% of the size of the inverted file.