Principles of database and knowledge-base systems, Vol. I
Principles of database and knowledge-base systems, Vol. I
Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Execution performance issues in full-text information retrieval
Execution performance issues in full-text information retrieval
Optimization of inverted vector searches
SIGIR '85 Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval
Interaction of query evaluation and buffer management for information retrieval
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
A language modeling approach to information retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Information Retrieval: Algorithms and Heuristics
Information Retrieval: Algorithms and Heuristics
Reducing the Braking Distance of an SQL Query Engine
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Computing Iceberg Queries Efficiently
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Evaluating Top-k Selection Queries
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Probabilistic Optimization of Top N Queries
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
BNCOD 18 Proceedings of the 18th British National Conference on Databases: Advances in Databases
The relationship between IR and multimedia databases
IRSG'98 Proceedings of the 20th Annual BCS-IRSG conference on Information Retrieval Research
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Flexible digital library search
Web-enabled systems integration
A Selectivity Model for Fragmented Relations: Applied in Information Retrieval
IEEE Transactions on Knowledge and Data Engineering
Goal-oriented methods and meta methods for document classification and their parameter tuning
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Meta methods for model sharing in personal information systems
ACM Transactions on Information Systems (TOIS)
Hi-index | 0.00 |
Efficient, flexible, and scalable integration of full text information retrieval (IR) in a DBMS is not a trivial case. This holds in particular for query optimization in such a context. To facilitate the bulk-oriented behavior of database query processing, a priori knowledge of how to limit the data efficiently prior to query evaluation is very valuable at optimization time. The usually imprecise nature of IR querying provides an extra opportunity to limit the data by a trade-off with the quality of the answer. In this paper we present a mathematically derived model to predict the quality implications of neglecting information before query execution. In particular we investigate the possibility to predict the retrieval quality for a document collection for which no training information is available, which is usually the case in practice. Instead, we construct a model that can be trained on other document collections for which the necessary quality information is available, or can be obtained quite easily. We validate our model for several document collections and present the experimental results. These results show that our model performs quite well, even for the case were we did not train it on the test collection itself.