An extended relational document retrieval model
Information Processing and Management: an International Journal - The Potential for Improvments in Commerical Document Retrieval Systems
Supporting full-text information retrieval with a persistent object store
EDBT '94 Proceedings of the 4th international conference on extending database technology: Advances in database technology
Incremental updates of inverted lists for text document retrieval
SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
Inverted File Partitioning Schemes in Multiple Disk Systems
IEEE Transactions on Parallel and Distributed Systems
In situ generation of compressed inverted files
Journal of the American Society for Information Science
Dissemination of collection wide information in a distributed information retrieval system
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Fast evaluation of structured queries for information retrieval
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Self-indexing inverted files for fast text retrieval
ACM Transactions on Information Systems (TOIS)
Resource scheduling for parallel database and scientific applications
Proceedings of the eighth annual ACM symposium on Parallel algorithms and architectures
A design of a distributed full text retrieval system
Proceedings of the 9th annual international ACM SIGIR conference on Research and development in information retrieval
Query performance for tightly coupled distributed digital libraries
Proceedings of the third ACM conference on Digital libraries
Compressed inverted files with reduced decoding overheads
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Inquirus, the NECI meta search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Efficient distributed algorithms to build inverted files
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Suffix arrays: a new method for on-line string searches
SODA '90 Proceedings of the first annual ACM-SIAM symposium on Discrete algorithms
Managing gigabytes (2nd ed.): compressing and indexing documents and images
Managing gigabytes (2nd ed.): compressing and indexing documents and images
WebBase: a repository of Web pages
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Signature files: an access method for documents and its analytical performance evaluation
ACM Transactions on Information Systems (TOIS)
PDIS '93 Proceedings of the second international conference on Parallel and distributed information systems
Database System Implementation
Database System Implementation
Query processing and inverted indices in shared: nothing text document information retrieval systems
The VLDB Journal — The International Journal on Very Large Data Bases - Parallelism in database systems
The Evolution of the Web and Implications for an Incremental Crawler
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
An Efficient Indexing Technique for Full Text Databases
VLDB '92 Proceedings of the 18th International Conference on Very Large Data Bases
Fast Incremental Indexing for Full-Text Information Retrieval
VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Methodologies for Distributed Information Retrieval
ICDCS '98 Proceedings of the The 18th International Conference on Distributed Computing Systems
ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Concept-based querying in mediator systems
The VLDB Journal — The International Journal on Very Large Data Bases
Improving collection selection with overlap awareness in P2P search engines
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Kalchas: a dynamic XML search engine
Proceedings of the 14th ACM international conference on Information and knowledge management
Inverted files for text search engines
ACM Computing Surveys (CSUR)
Load balancing for term-distributed parallel retrieval
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Architecture of a grid-enabled Web search engine
Information Processing and Management: an International Journal
Efficient in-memory extensible inverted file
Information Systems
Pruning policies for two-tiered inverted index with correctness guarantee
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Locality-Based pruning methods for web search
ACM Transactions on Information Systems (TOIS)
Structural optimization of a full-text n-gram index using relational normalization
The VLDB Journal — The International Journal on Very Large Data Bases
Collection selection: ...now, with more documents!
Proceedings of the 3rd international conference on Scalable information systems
A search-based method for forecasting ad impression in contextual advertising
Proceedings of the 18th international conference on World wide web
On the feasibility of multi-site web search engines
Proceedings of the 18th ACM conference on Information and knowledge management
Leveraging a scalable row store to build a distributed text index
Proceedings of the first international workshop on Cloud data management
Aggregation of Document Frequencies in Unstructured P2P Networks
WISE '09 Proceedings of the 10th International Conference on Web Information Systems Engineering
Semplore: an IR approach to scalable hybrid query of semantic web data
ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
Mining Query Logs: Turning Search Usage Data into Knowledge
Foundations and Trends in Information Retrieval
Efficient answering of set containment queries for skewed item distributions
Proceedings of the 14th International Conference on Extending Database Technology
A scalable real-time search engine for fast retrieval of social media content
Proceedings of the 2nd international workshop on Ubiquitous crowdsouring
On the usage of global document occurrences in peer-to-peer information systems
OTM'05 Proceedings of the 2005 Confederated international conference on On the Move to Meaningful Internet Systems - Volume >Part I
The MINERVA project: towards collaborative search in digital libraries using peer-to-peer technology
DELOS'04 Proceedings of the 6th Thematic conference on Peer-to-Peer, Grid, and Service-Orientation in Digital Library Architectures
Towards a distributed search engine
CIAC'10 Proceedings of the 7th international conference on Algorithms and Complexity
A fast algorithm for constructing inverted files on heterogeneous platforms
Journal of Parallel and Distributed Computing
A plagiarism detection system for arabic text-based documents
PAISI'12 Proceedings of the 2012 Pacific Asia conference on Intelligence and Security Informatics
Maguro, a system for indexing and searching over very large text collections
Proceedings of the sixth ACM international conference on Web search and data mining
Hi-index | 0.00 |
We identify crucial design issues in building a distributed inverted index for a large collection of Web pages. We introduce a novel pipelining technique for structuring the core index-building system that substantially reduces the index construction time. We also propose a storage scheme for creating and managing inverted files using an embedded database system. We suggest and compare different strategies for collecting global statistics from distributed inverted indexes. Finally, we present performance results from experiments on a testbed distributed Web indexing system that we have implemented.