Database performance evaluation in an indexed file environment
ACM Transactions on Database Systems (TODS)
Communications of the ACM
SIGMOD '88 Proceedings of the 1988 ACM SIGMOD international conference on Management of data
A generic machine for parallel information retrieval
Information Processing and Management: an International Journal
A parallel indexed algorithm for information retrieval
SIGIR '89 Proceedings of the 12th annual international ACM SIGIR conference on Research and development in information retrieval
An Evaluation of Multiple-Disk I/O Systems
IEEE Transactions on Computers
Partitioned posting files: a parallel inverted file structure for information retrieval
SIGIR '90 Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
Parallel text searching in serial files using a processor farm
SIGIR '90 Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
Introduction: parallel processing and information retrieval
Information Processing and Management: an International Journal - Special issue on parallel processing and information retrieval
On the allocation of documents in multiprocessor information retrieval systems
SIGIR '91 Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval
The art of computer programming, volume 3: (2nd ed.) sorting and searching
The art of computer programming, volume 3: (2nd ed.) sorting and searching
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Hash-Based and Index-Based Join Algorithms for Cube and Ring Connected Multicomputers
IEEE Transactions on Knowledge and Data Engineering
A Multiuser Performance Analysis of Alternative Declustering Strategies
Proceedings of the Sixth International Conference on Data Engineering
Implementing Relational Database Operations in a Cube-Connected Multicomputer System
Proceedings of the Third International Conference on Data Engineering
Performance of Inverted Indices in Distributed Text Document Retrieval Systems
PDIS '93 Proceedings of the 2nd International Conference on Parallel and Distributed Information Systems
GAMMA - A High Performance Dataflow Database Machine
VLDB '86 Proceedings of the 12th International Conference on Very Large Data Bases
An Analysis of Three Transaction Processing Architectures
VLDB '88 Proceedings of the 14th International Conference on Very Large Data Bases
Performance Analysis of a Load Balancing Hash-Join Algorithm for a Shared Memory Multiprocessor
VLDB '91 Proceedings of the 17th International Conference on Very Large Data Bases
Performance evaluation of a distributed architecture for information retrieval
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Query performance for tightly coupled distributed digital libraries
Proceedings of the third ACM conference on Digital libraries
ACM Transactions on Information Systems (TOIS)
Building a distributed full-text index for the Web
Proceedings of the 10th international conference on World Wide Web
Building a distributed full-text index for the web
ACM Transactions on Information Systems (TOIS)
Hybrid Partition Inverted Files: Experimental Validation
ECDL '02 Proceedings of the 6th European Conference on Research and Advanced Technology for Digital Libraries
PLIERS: A Parallel Information Retrieval System Using MPI
Proceedings of the 6th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Optimizing result prefetching in web search engines with segmented indices
ACM Transactions on Internet Technology (TOIT)
A case study of distributed information retrieval architectures to index one terabyte of text
Information Processing and Management: an International Journal
Inverted files for text search engines
ACM Computing Surveys (CSUR)
Load balancing for term-distributed parallel retrieval
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Stanford WebBase components and applications
ACM Transactions on Internet Technology (TOIT)
Efficient in-memory extensible inverted file
Information Systems
A pipelined architecture for distributed text query evaluation
Information Retrieval
Information Processing and Management: an International Journal
Optimizing result prefetching in web search engines with segmented indices
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Optimized query execution in large search engines with global page ordering
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Mining query logs to optimize index partitioning in parallel web search engines
Proceedings of the 2nd international conference on Scalable information systems
Two-Dimensional Distributed Inverted Files
SPIRE '09 Proceedings of the 16th International Symposium on String Processing and Information Retrieval
PDCS '07 Proceedings of the 19th IASTED International Conference on Parallel and Distributed Computing and Systems
A case study of distributed information retrieval architectures to index one terabyte of text
Information Processing and Management: an International Journal
Performance comparison of clustered and replicated information retrieval systems
ECIR'07 Proceedings of the 29th European conference on IR research
Scalable online index construction with multi-core CPUs
ADC '10 Proceedings of the Twenty-First Australasian Conference on Database Technologies - Volume 104
Load and storage balanced posting file partitioning for parallel information retrieval
Journal of Systems and Software
On-line multi-threaded processing of web user-clicks on multi-core processors
VECPAR'10 Proceedings of the 9th international conference on High performance computing for computational science
An evaluation of fault-tolerant query processing for web search engines
Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part I
ISCIS'06 Proceedings of the 21st international conference on Computer and Information Sciences
Replicated partitioning for undirected hypergraphs
Journal of Parallel and Distributed Computing
Fast concurrency control for distributed inverted files
ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part I
Scalable search platform: improving pipelined query processing for distributed full-text retrieval
Proceedings of the 21st international conference companion on World Wide Web
An investigation into query throughput and load balance using grid IR
FDIA'08 Proceedings of the 2nd BCS IRSG conference on Future Directions in Information Access
(Sync|Async)+ MPI search engines
PVM/MPI'07 Proceedings of the 14th European conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface
A term-based inverted index partitioning model for efficient distributed query processing
ACM Transactions on the Web (TWEB)
Hi-index | 0.00 |
Multiple-disk I/O systems (disk arrays) have been an attractive approach to meet high performance I/O demands in data intensive applications such as information retrieval systems. When we partition and distribute files across multiple disks to exploit the potential for I/O parallelism, a balanced I/O workload distribution becomes important for good performance. Naturally, the performance of a parallel information retrieval system using an inverted file structure is affected by the partitioning scheme of the inverted file. In this paper, we propose two different partitioning schemes for an inverted file system for a shared-everything multiprocessor machine with multiple disks. We study the performance of these schemes by simulation under a number of workloads where the term frequencies in the documents are varied, the term frequencies in the queries are varied, the number of disks are varied and the multiprogramming level is varied.