Using statistical testing in the evaluation of retrieval experiments
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
The effectiveness of GIOSS for the text database discovery problem
SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
Combining the evidence of multiple query representations for information retrieval
TREC-2 Proceedings of the second conference on Text retrieval conference
Dissemination of collection wide information in a distributed information retrieval system
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Searching distributed collections with inference networks
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Learning collection fusion strategies
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Effective retrieval with distributed collections
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Evaluating database selection techniques: a testbed and experiment
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
On the fusion of documents from multiple collection information retrieval systems
Journal of the American Society for Information Science
Methods for information server selection
ACM Transactions on Information Systems (TOIS)
Comparing the performance of database selection algorithms
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Cluster-based language models for distributed retrieval
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
A decision-theoretic approach to database selection in networked IR
ACM Transactions on Information Systems (TOIS)
GlOSS: text-source discovery over the Internet
ACM Transactions on Database Systems (TODS)
Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Server Ranking for Distributed Text Retrieval Systems on the Internet
Proceedings of the Fifth International Conference on Database Systems for Advanced Applications (DASFAA)
Collection selection and results merging with topically organized U.S. patents and TREC data
Proceedings of the ninth international conference on Information and knowledge management
Efficient and effective metasearch for text databases incorporating linkages among documents
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Query-based sampling of text databases
ACM Transactions on Information Systems (TOIS)
The effectiveness of query expansion for distributed information retrieval
Proceedings of the tenth international conference on Information and knowledge management
Approaches to collection selection and results merging for distributed information retrieval
Proceedings of the tenth international conference on Information and knowledge management
Exploiting a controlled vocabulary to improve collection selection and retrieval effectiveness
Proceedings of the tenth international conference on Information and knowledge management
Using sampled data and regression to merge search engine results
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
A local search mechanism for peer-to-peer networks
Proceedings of the eleventh international conference on Information and knowledge management
Personalized web search by mapping user queries to categories
Proceedings of the eleventh international conference on Information and knowledge management
Exploiting Manual Indexing to Improve Collection Selection and Retrieval Effectiveness
Information Retrieval
Metrics for evaluating database selection techniques
World Wide Web
ACM Transactions on Information Systems (TOIS)
Integrating Distributed Information Sources with CARROT II
CIA '02 Proceedings of the 6th International Workshop on Cooperative Information Agents VI
Cross-language information retrieval: experiments based on CLEF 2000 corpora
Information Processing and Management: an International Journal
Relevant document distribution estimation method for resource selection
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Error analysis of difficult TREC topics
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Comparing the performance of collection selection algorithms
ACM Transactions on Information Systems (TOIS)
A semisupervised learning method to merge search engine results
ACM Transactions on Information Systems (TOIS)
Personalized Web Search For Improving Retrieval Effectiveness
IEEE Transactions on Knowledge and Data Engineering
A reliable storage management layer for distributed information retrieval systems
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Combining Multiple Strategies for Effective Monolingual and Cross-Language Retrieval
Information Retrieval
Distributed information retrieval: a multi-objective resource selection approach
International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems - Intelligent information systems
Collection selection for managed distributed document databases
Information Processing and Management: an International Journal
Information Retrieval Techniques for Peer-to-Peer Networks
Computing in Science and Engineering
Unified utility maximization framework for resource selection
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Client-system collaboration for legal corpus selection in an online production environment
ICAIL '03 Proceedings of the 9th international conference on Artificial intelligence and law
Technical issues of cross-language information retrieval: a review
Information Processing and Management: an International Journal - Special issue: Cross-language information retrieval
Neural Processing Letters
Information source selection for resource constrained environments
ACM SIGMOD Record
Query-driven document partitioning and collection selection
InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
ProbFuse: a probabilistic approach to data fusion
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Distributed query sampling: a quality-conscious approach
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Exploiting locality for scalable information retrieval in peer-to-peer networks
Information Systems
Evaluating sampling methods for uncooperative collections
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Distributed search over the hidden web: hierarchical database sampling and selection
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Probability-based fusion of information retrieval result sets
Artificial Intelligence Review
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Classification-aware hidden-web text database selection
ACM Transactions on Information Systems (TOIS)
Information Processing and Management: an International Journal
A study of learning a merge model for multilingual information retrieval
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Simple Adaptations of Data Fusion Algorithms for Source Selection
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Generative model-based metasearch for data fusion in information retrieval
Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries
Combining LVCSR and vocabulary-independent ranked utterance retrieval for robust speech search
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Server selection methods in personal metasearch: a comparative empirical study
Information Retrieval
Learning from past queries for resource selection
Proceedings of the 18th ACM conference on Information and knowledge management
Results merging algorithm using multiple regression models
ECIR'07 Proceedings of the 29th European conference on IR research
An evaluation measure for distributed information retrieval systems
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Mining Query Logs: Turning Search Usage Data into Knowledge
Foundations and Trends in Information Retrieval
Information Sciences: an International Journal
Estimating probabilities for effective data fusion
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Multilingual novelty detection
Expert Systems with Applications: An International Journal
Modeling information sources as integrals for effective and efficient source selection
Information Processing and Management: an International Journal
Ranking multilingual documents using minimal language dependent resources
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
Foundations and Trends in Information Retrieval
Multilingual sentence categorization and novelty mining
Information Processing and Management: an International Journal
Learning a merge model for multilingual information retrieval
Information Processing and Management: an International Journal
Ontology-driven personalized query refinement
Journal of Web Engineering
Indexing and weighting of multilingual and mixed documents
Proceedings of the South African Institute of Computer Scientists and Information Technologists Conference on Knowledge, Innovation and Leadership in a Diverse, Multidisciplinary Environment
Beauty and the beast: the theory and practice of information integration
ICDT'07 Proceedings of the 11th international conference on Database Theory
Evaluating server selection for federated search
ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Oracle in Image Search: A Content-Based Approach to Performance Prediction
ACM Transactions on Information Systems (TOIS)
To what problem is distributed information retrieval the solution?
Journal of the American Society for Information Science and Technology
Architecture and evaluation of BRUJA, a multilingual question answering system
Information Retrieval
SINAI at CLEF 2006 ad hoc robust multilingual track: query expansion using the Google search engine
CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval
Hi-index | 0.00 |
The proliferation of online information resources increases the importance of effective and efficient distributed searching. Distributed searching is cast in three parts — database selection, query processing, and results merging. In this paper we examine the effect of database selection on retrieval performance. We look at retrieval performance in three different distributed retrieval testbeds and distill some general results. First we find that good database selection can result in better retrieval effectiveness than can be achieved in a centralized database. Second we find that good performance can be achieved when only a few sites are selected and that the performance generally increases as more sites are selected. Finally we find that when database selection is employed, it is not necessary to maintain collection wide information (CWI), e.g. global idf. Local information can be used to achieve superior performance. This means that distributed systems can be engineered with more autonomy and less cooperation. This work suggests that improvements in database selection can lead to broader improvements in retrieval performance, even in centralized (i.e. single database) systems. Given a centralized database and a good selection mechanism, retrieval performance can be improved by decomposing that database conceptually and employing a selection step.