18th International Conference on Research Development in Information Retrieval
NetSerf: using semantic knowledge to find Internet information archives
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Searching distributed collections with inference networks
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Learning collection fusion strategies
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
WordNet: a lexical database for English
Communications of the ACM
Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Server Ranking for Distributed Text Retrieval Systems on the Internet
Proceedings of the Fifth International Conference on Database Systems for Advanced Applications (DASFAA)
STARTS: Stanford Protocol Proposal for Internet Retrieval and Search
STARTS: Stanford Protocol Proposal for Internet Retrieval and Search
Pharos: A Scalable Distributed Architecture for Locating Heterogeneous Information Sources
Pharos: A Scalable Distributed Architecture for Locating Heterogeneous Information Sources
Cluster-based language models for distributed retrieval
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Server selection on the World Wide Web
DL '00 Proceedings of the fifth ACM conference on Digital libraries
The impact of database selection on distributed searching
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Collection selection and results merging with topically organized U.S. patents and TREC data
Proceedings of the ninth international conference on Information and knowledge management
Towards a highly-scalable and effective metasearch engine
Proceedings of the 10th international conference on World Wide Web
Probe, count, and classify: categorizing hidden web databases
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
SDLIP + STARTS = SDARTS a protocol and toolkit for metasearching
Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
Query-based sampling of text databases
ACM Transactions on Information Systems (TOIS)
A highly scalable and effective method for metasearch
ACM Transactions on Information Systems (TOIS)
The effectiveness of query expansion for distributed information retrieval
Proceedings of the tenth international conference on Information and knowledge management
Approaches to collection selection and results merging for distributed information retrieval
Proceedings of the tenth international conference on Information and knowledge management
Exploiting a controlled vocabulary to improve collection selection and retrieval effectiveness
Proceedings of the tenth international conference on Information and knowledge management
Building efficient and effective metasearch engines
ACM Computing Surveys (CSUR)
A survey in indexing and searching XML documents
Journal of the American Society for Information Science and Technology - XML
Topic-oriented collaborative crawling
Proceedings of the eleventh international conference on Information and knowledge management
A language modeling framework for resource selection and results merging
Proceedings of the eleventh international conference on Information and knowledge management
Exploiting Manual Indexing to Improve Collection Selection and Retrieval Effectiveness
Information Retrieval
Metrics for evaluating database selection techniques
World Wide Web
QProber: A system for automatic classification of hidden-Web databases
ACM Transactions on Information Systems (TOIS)
ACM Transactions on Information Systems (TOIS)
Automated discovery of search interfaces on the web
ADC '03 Proceedings of the 14th Australasian database conference - Volume 17
Result merging strategies for a current news metasearcher
Information Processing and Management: an International Journal
Comparing the performance of collection selection algorithms
ACM Transactions on Information Systems (TOIS)
A semisupervised learning method to merge search engine results
ACM Transactions on Information Systems (TOIS)
Information Retrieval with Distributed Databases: Analytic Models of Performance
IEEE Transactions on Parallel and Distributed Systems
Distributed information retrieval: a multi-objective resource selection approach
International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems - Intelligent information systems
Probe, Cluster, and Discover: Focused Extraction of QA-Pagelets from the Deep Web
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Collection selection for managed distributed document databases
Information Processing and Management: an International Journal
Performance and cost tradeoffs in Web search
ADC '04 Proceedings of the 15th Australasian database conference - Volume 27
Structured databases on the web: observations and implications
ACM SIGMOD Record
Discovering and ranking web services with BASIL: a personalized approach with biased focus
Proceedings of the 2nd international conference on Service oriented computing
Client-system collaboration for legal corpus selection in an online production environment
ICAIL '03 Proceedings of the 9th international conference on Artificial intelligence and law
A case study of distributed information retrieval architectures to index one terabyte of text
Information Processing and Management: an International Journal
Information source selection for resource constrained environments
ACM SIGMOD Record
Two-stage statistical language models for text database selection
Information Retrieval
Query-driven document partitioning and collection selection
InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
Testing the cluster hypothesis in distributed information retrieval
Information Processing and Management: an International Journal
Result merging methods in distributed information retrieval with overlapping databases
Information Retrieval
Information Processing and Management: an International Journal
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Distributed search over the hidden web: hierarchical database sampling and selection
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Classification-aware hidden-web text database selection
ACM Transactions on Information Systems (TOIS)
Load-balancing and caching for collection selection architectures
Proceedings of the 2nd international conference on Scalable information systems
Mining world knowledge for analysis of search engine content
Web Intelligence and Agent Systems
Ranking information resources in peer-to-peer text retrieval: an experimental study
Proceedings of the 2008 ACM workshop on Large-Scale distributed systems for information retrieval
Server selection methods in personal metasearch: a comparative empirical study
Information Retrieval
Mutli-agent System for Personalizing Information Source Selection
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
A case study of distributed information retrieval architectures to index one terabyte of text
Information Processing and Management: an International Journal
Term proximity scoring for keyword-based retrieval systems
ECIR'03 Proceedings of the 25th European conference on IR research
Performance comparison of clustered and replicated information retrieval systems
ECIR'07 Proceedings of the 29th European conference on IR research
Mining Query Logs: Turning Search Usage Data into Knowledge
Foundations and Trends in Information Retrieval
Information Sciences: an International Journal
Personalised distributed information retrieval-based agents
International Journal of Intelligent Systems Technologies and Applications
Modeling information sources as integrals for effective and efficient source selection
Information Processing and Management: an International Journal
Foundations and Trends in Information Retrieval
To what problem is distributed information retrieval the solution?
Journal of the American Society for Information Science and Technology
Hi-index | 0.00 |
The problem of using a broker to select a subset of available information servers in order to achieve a good trade-off between document retrieval effectiveness and cost is addressed. Server selection methods which are capable of operating in the absence of global information, and where servers have no knowledge of brokers, are investigated. A novel method using Lightweight Probe queries (LWP method) is compared with several methods based on data from past query processing, while Random and Optimal server rankings serve as controls. Methods are evaluated, using TREC data and relevance judgments, by computing ratios, both empirical and ideal, of recall and early precision for the subset versus the complete set of available servers. Estimates are also made of the best-possible performance of each of the methods. LWP and Topic Similarity methods achieved best results, each being capable of retrieving about 60% of the relevant documents for only one-third of the cost of querying all servers. Subject to the applicable cost model, the LWP method is likely to be preferred because it is suited to dynamic environments. The good results obtained with a simple automatic LWP implementation were replicated using different data and a larger set of query topics.