On a model of distributed information retrieval systems based on thesauri
Information Processing and Management: an International Journal
Numerical recipes in C: the art of scientific computing
Numerical recipes in C: the art of scientific computing
Algorithms for clustering data
Algorithms for clustering data
Inference networks for document retrieval
SIGIR '90 Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval
Inference networks for document retrieval
Inference networks for document retrieval
Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
The effectiveness of GIOSS for the text database discovery problem
SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
ALIWEB—Archie-like indexing in the WEB
Selected papers of the first conference on World-Wide Web
18th International Conference on Research Development in Information Retrieval
NetSerf: using semantic knowledge to find Internet information archives
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Searching distributed collections with inference networks
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Learning collection fusion strategies
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Copy detection mechanisms for digital documents
SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Query expansion using local and global document analysis
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
STARTS: Stanford proposal for Internet meta-searching
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Experiences with selecting search engines using metasearch
ACM Transactions on Information Systems (TOIS)
20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
A probabilistic model for distributed information retrieval
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Analyses of multiple evidence combination
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Multiple search engines in database merging
DL '97 Proceedings of the second ACM international conference on Digital libraries
The probability ranking principle in IR
Readings in information retrieval
Syntactic clustering of the Web
Selected papers from the sixth international conference on World Wide Web
Querying multiple document collections across the Internet
Querying multiple document collections across the Internet
21st Annual ACM/SIGIR International Conference on Research and Development in Information Retrieval
Effective retrieval with distributed collections
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Evaluating database selection techniques: a testbed and experiment
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A language modeling approach to information retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Methods for information server selection
ACM Transactions on Information Systems (TOIS)
Inquirus, the NECI meta search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
A technique for measuring the relative size and overlap of public Web search engines
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Automatic discovery of language models for text databases
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
Comparing the performance of database selection algorithms
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
A probabilistic solution to the selection and fusion problem in distributed information retrieval
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Cluster-based language models for distributed retrieval
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Grouper: a dynamic clustering interface to Web search results
WWW '99 Proceedings of the eighth international conference on World Wide Web
A decision-theoretic approach to database selection in networked IR
ACM Transactions on Information Systems (TOIS)
Proceedings of the eighth international conference on Information and knowledge management
Conference on Information and Knowledge Management
Architecture of a metasearch engine that supports user information needs
Proceedings of the eighth international conference on Information and knowledge management
Efficient and effective metasearch for a large number of text databases
Proceedings of the eighth international conference on Information and knowledge management
GlOSS: text-source discovery over the Internet
ACM Transactions on Database Systems (TODS)
Server selection on the World Wide Web
DL '00 Proceedings of the fifth ACM conference on Digital libraries
IR evaluation methods for retrieving highly relevant documents
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
The impact of database selection on distributed searching
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Database merging strategy based on logistic regression
Information Processing and Management: an International Journal
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Query routing for Web search engines: architectures and experiments
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Collection selection and results merging with topically organized U.S. patents and TREC data
Proceedings of the ninth international conference on Information and knowledge management
Proceedings of the 10th international conference on World Wide Web
Hypermedia Track of the 10th International World Wide Web Conference
Towards a highly-scalable and effective metasearch engine
Proceedings of the 10th international conference on World Wide Web
Efficient and effective metasearch for text databases incorporating linkages among documents
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
SDLIP + STARTS = SDARTS a protocol and toolkit for metasearching
Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
Query-based sampling of text databases
ACM Transactions on Information Systems (TOIS)
A scalable content-addressable network
Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
Document language models, query models, and risk minimization for information retrieval
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Effective site finding using link anchor information
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
A highly scalable and effective method for metasearch
ACM Transactions on Information Systems (TOIS)
Merging techniques for performing data fusion on the web
Proceedings of the tenth international conference on Information and knowledge management
The effectiveness of query expansion for distributed information retrieval
Proceedings of the tenth international conference on Information and knowledge management
Approaches to collection selection and results merging for distributed information retrieval
Proceedings of the tenth international conference on Information and knowledge management
Exploiting a controlled vocabulary to improve collection selection and retrieval effectiveness
Proceedings of the tenth international conference on Information and knowledge management
Building efficient and effective metasearch engines
ACM Computing Surveys (CSUR)
Expert agreement and content based reranking in a meta search environment using Mearf
Proceedings of the 11th international conference on World Wide Web
Modern Information Retrieval
25th ACM/SIGIR International Conference on Research and Development in Information Retrieval
Using sampled data and regression to merge search engine results
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
A logistic regression approach to distributed IR
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Experiments on data fusion using headline information
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Pruning long documents for distributed information retrieval
Proceedings of the eleventh international conference on Information and knowledge management
A language modeling framework for resource selection and results merging
Proceedings of the eleventh international conference on Information and knowledge management
Discovering the representative of a search engine
Proceedings of the eleventh international conference on Information and knowledge management
Fusion Via a Linear Combination of Scores
Information Retrieval
Metrics for evaluating database selection techniques
World Wide Web
WISE: A World Wide Web Resource Database System
IEEE Transactions on Knowledge and Data Engineering
A Methodology to Retrieve Text Documents from Multiple Databases
IEEE Transactions on Knowledge and Data Engineering
QProber: A system for automatic classification of hidden-Web databases
ACM Transactions on Information Systems (TOIS)
ACM Transactions on Information Systems (TOIS)
Chord: a scalable peer-to-peer lookup protocol for internet applications
IEEE/ACM Transactions on Networking (TON)
Proceedings of the 27th International Conference on Very Large Data Bases
27th International Conference on Very Large Data Bases
Improving Text Classification by Shrinkage in a Hierarchy of Classes
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Precision and Recall of GlOSS Estimators for Database Discovery
PDIS '94 Proceedings of the Third International Conference on Parallel and Distributed Information Systems
Proceedings of the 27th International Conference on Very Large Data Bases
RoadRunner: Towards Automatic Data Extraction from Large Web Sites
Proceedings of the 27th International Conference on Very Large Data Bases
Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Server Ranking for Distributed Text Retrieval Systems on the Internet
Proceedings of the Fifth International Conference on Database Systems for Advanced Applications (DASFAA)
A Comparison of Techniques for Selecting Text Collections
ADC '00 Proceedings of the Australasian Database Conference
Automated discovery of search interfaces on the web
ADC '03 Proceedings of the 14th Australasian database conference - Volume 17
Obtaining Language Models of Web Collections Using Query-Based Sampling Techniques
HICSS '02 Proceedings of the 35th Annual Hawaii International Conference on System Sciences (HICSS'02)-Volume 3 - Volume 3
CDM: an approach to learning in text categorization
TAI '95 Proceedings of the Seventh International Conference on Tools with Artificial Intelligence
Methodologies for Distributed Information Retrieval
ICDCS '98 Proceedings of the The 18th International Conference on Distributed Computing Systems
The 26th ACM/SIGIR International Symposium on Information Retrieval
Evaluating different methods of estimating retrieval quality for resource selection
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Relevant document distribution estimation method for resource selection
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
MIND: resource selection and data fusion in multimedia distributed digital libraries
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Result merging strategies for a current news metasearcher
Information Processing and Management: an International Journal
A Comparison of Two Methods for Boolean Query Relevance Feedback
A Comparison of Two Methods for Boolean Query Relevance Feedback
Extracting structured data from Web pages
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
A Methodology for Collection Selection in Heterogeneous Contexts
ITCC '02 Proceedings of the International Conference on Information Technology: Coding and Computing
Automatic Information Discovery from the "Invisible Web"
ITCC '02 Proceedings of the International Conference on Information Technology: Coding and Computing
A Meta-Search Method Reinforced by Cluster Descriptors
WISE '01 Proceedings of the Second International Conference on Web Information Systems Engineering (WISE'01) Volume 1 - Volume 1
Determining Stopping Criteria in the Generation of Web-Derived Langua ge Models
Determining Stopping Criteria in the Generation of Web-Derived Langua ge Models
Adaptive combination of evidence for information retrieval
Adaptive combination of evidence for information retrieval
Database selection in distributed information retrieval: a study of multi-collection information retrieval
Comparing the performance of collection selection algorithms
ACM Transactions on Information Systems (TOIS)
A semisupervised learning method to merge search engine results
ACM Transactions on Information Systems (TOIS)
The Journal of Machine Learning Research
On the Evolution of Clusters of Near-Duplicate Web Pages
LA-WEB '03 Proceedings of the First Conference on Latin American Web Congress
Web metasearch: rank vs. score based rank aggregation methods
Proceedings of the 2003 ACM symposium on Applied computing
Engineering a multi-purpose test collection for web retrieval experiments
Information Processing and Management: an International Journal
Mining data records in Web pages
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Proceedings of the twelfth international conference on Information and knowledge management
12th International Conference on Information and Knowledge Management
Content-based retrieval in hybrid peer-to-peer networks
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
A unified model for metasearch, pooling, and system evaluation
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Intelligent metasearch engine for knowledge management
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Effective page refresh policies for Web crawlers
ACM Transactions on Database Systems (TODS)
Distributed information retrieval: a multi-objective resource selection approach
International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems - Intelligent information systems
Shadow document methods of resutls merging
Proceedings of the 2004 ACM symposium on Applied computing
ACSC '04 Proceedings of the 27th Australasian conference on Computer science - Volume 26
Distributed Multimedia Information Retrieval: Sigir 2003 Workshop on Distributed Information Retrieval, Toronto, Canada, August 2003: Revised, Selected, and Invited Papers (Lecture Notes in Computer Science, 2924)
Collection selection for managed distributed document databases
Information Processing and Management: an International Journal
When one sample is not enough: improving text database selection using shrinkage
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Query-related data extraction of hidden web documents
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Unified utility maximization framework for resource selection
Proceedings of the thirteenth ACM international conference on Information and knowledge management
A two-phase sampling technique for information extraction from hidden web databases
Proceedings of the 6th annual ACM international workshop on Web information and data management
Classifying and searching hidden-web text databases
Classifying and searching hidden-web text databases
Modeling and Managing Content Changes in Text Databases
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Proceedings of the 14th international conference on World Wide Web
WWW '05 Proceedings of the 14th international conference on World Wide Web
Fully automatic wrapper generation for search engines
WWW '05 Proceedings of the 14th international conference on World Wide Web
Sampling search-engine results
WWW '05 Proceedings of the 14th international conference on World Wide Web
The indexable web is more than 11.5 billion pages
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Improving text collection selection with coverage and overlap statistics
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
The TREC terabyte retrieval track
ACM SIGIR Forum
Server selection methods in hybrid portal search
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Modeling search engine effectiveness for federated search
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Ontology-Based Resource Descriptions for Distributed Information Sources
ICITA '05 Proceedings of the Third International Conference on Information Technology and Applications (ICITA'05) Volume 2 - Volume 02
Information source selection for resource constrained environments
ACM SIGMOD Record
The FedLemur project: Federated search in the real world
Journal of the American Society for Information Science and Technology
Two-stage statistical language models for text database selection
Information Retrieval
Reducing storage costs for federated search of text databases
dg.o '03 Proceedings of the 2003 annual national conference on Digital government research
Random sampling from a search engine's index
Proceedings of the 15th international conference on World Wide Web
An evaluation of resource description quality measures
Proceedings of the 2006 ACM symposium on Applied computing
Performance prediction of data fusion for information retrieval
Information Processing and Management: an International Journal
Towards better measures: evaluation of estimated resource description quality for distributed IR
InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
SIGIR '06 The 29th Annual International SIGIR Conference
ProbFuse: a probabilistic approach to data fusion
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Improving the estimation of relevance models using large external corpora
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Capturing collection size for distributed non-cooperative retrieval
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
User modeling for full-text federated search in peer-to-peer networks
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Distributed query sampling: a quality-conscious approach
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
A study of results overlap and uniqueness among major web search engines
Information Processing and Management: an International Journal
A Survey of Web Information Extraction Systems
IEEE Transactions on Knowledge and Data Engineering
Automatic extraction of dynamic record sections from search engine result pages
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Estimating corpus size via queries
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
String Processing and Information Retrieval: 13th International Conference, SPIRE 2006, Glasgow, UK, October 11-13, 2006, Proceedings (Lecture Notes in Computer Science)
Proceedings of the 16th international conference on World Wide Web
16th International World Wide Web Conference
Proceedings of the 16th international conference on World Wide Web
16th International World Wide Web Conference
Efficient search engine measurements
Proceedings of the 16th international conference on World Wide Web
AllInOneNews: development and evaluation of a large-scale news metasearch engine
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Finding similar files in a large file system
WTEC'94 Proceedings of the USENIX Winter 1994 Technical Conference on USENIX Winter 1994 Technical Conference
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Distributed text retrieval from overlapping collections
ADC '07 Proceedings of the eighteenth conference on Australasian database - Volume 63
Using query logs to establish vocabularies in distributed information retrieval
Information Processing and Management: an International Journal
The 30th Annual International SIGIR Conference
Latent concept expansion using markov random fields
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Federated text retrieval from uncooperative overlapped collections
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Evaluating sampling methods for uncooperative collections
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Updating collection representations for federated search
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Estimating collection size with logistic regression
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Mining templates from search result records of search engines
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Distributed search over the hidden web: hierarchical database sampling and selection
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Computing pagerank in a distributed internet search system
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Classification-aware hidden-web text database selection
ACM Transactions on Information Systems (TOIS)
Full-text federated search in peer-to-peer networks
Full-text federated search in peer-to-peer networks
Retrieval and feedback models for blog feed search
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Towards personalized distributed information retrieval
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Generalising multiple capture-recapture to non-uniform sample sizes
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Experiences evaluating personal metasearch
Proceedings of the second international symposium on Information interaction in context
Proceedings of the VLDB Endowment
Proceedings of the 17th ACM conference on Information and knowledge management
Conference on Information and Knowledge Management
Blog site search using resource selection
Proceedings of the 17th ACM conference on Information and knowledge management
Efficient estimation of the size of text deep web data source
Proceedings of the 17th ACM conference on Information and knowledge management
ACM SIGIR Forum
An Approach to Deep Web Crawling by Sampling
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Integration of news content into web results
Proceedings of the Second ACM International Conference on Web Search and Data Mining
Robust result merging using sample-based score estimates
ACM Transactions on Information Systems (TOIS)
A Topic-Based Measure of Resource Description Quality for Distributed Information Retrieval
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
The 32nd International ACM SIGIR conference on research and development in Information Retrieval
Sources of evidence for vertical selection
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Adaptation of offline vertical selection predictions in the presence of user feedback
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Click-through prediction for news queries
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
SUSHI: scoring scaled samples for server selection
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Effective query expansion for federated search
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
The 32nd International ACM SIGIR conference on research and development in Information Retrieval
Server selection methods in personal metasearch: a comparative empirical study
Information Retrieval
Proceedings of the 18th ACM conference on Information and knowledge management
Conference on Information and Knowledge Management
Classification-based resource selection
Proceedings of the 18th ACM conference on Information and knowledge management
Learning from past queries for resource selection
Proceedings of the 18th ACM conference on Information and knowledge management
ViDE: A Vision-Based Approach for Deep Web Data Extraction
IEEE Transactions on Knowledge and Data Engineering
Estimating deep web data source size by capture---recapture method
Information Retrieval
Central-rank-based collection selection in uncooperative distributed information retrieval
ECIR'07 Proceedings of the 29th European conference on IR research
Segmentation of search engine results for effective data-fusion
ECIR'07 Proceedings of the 29th European conference on IR research
Extending probabilistic data fusion using sliding windows
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Advanced Metasearch Engine Technology
Advanced Metasearch Engine Technology
The 33rd International ACM SIGIR conference on research and development in Information Retrieval
Ranking using multiple document types in desktop search
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
A joint probabilistic classification model for resource selection
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Learning trees and rules with set-valued features
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
CLEF 2005: multilingual retrieval by combining multiple multilingual ranked lists
CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
Compact features for detection of near-duplicates in distributed retrieval
SPIRE'06 Proceedings of the 13th international conference on String Processing and Information Retrieval
Adaptive query-based sampling of distributed collections
SPIRE'06 Proceedings of the 13th international conference on String Processing and Information Retrieval
Evaluation of result merging strategies for metasearch engines
WISE'05 Proceedings of the 6th international conference on Web Information Systems Engineering
Sample sizes for query probing in uncooperative distributed information retrieval
APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
Federated search of text-based digital libraries in hierarchical peer-to-peer networks
ECIR'05 Proceedings of the 27th European conference on Advances in Information Retrieval Research
Evaluating large-scale distributed vertical search
Proceedings of the 9th workshop on Large-scale and distributed informational retrieval
To what problem is distributed information retrieval the solution?
Journal of the American Society for Information Science and Technology
Foundations and Trends in Information Retrieval
Federated search in the wild: the combined power of over a hundred search engines
Proceedings of the 21st ACM international conference on Information and knowledge management
Ranking distributed knowledge repositories
TPDL'12 Proceedings of the Second international conference on Theory and Practice of Digital Libraries
Collection ranking and selection for federated entity search
SPIRE'12 Proceedings of the 19th international conference on String Processing and Information Retrieval
Studying the clustering paradox and scalability of search in highly distributed environments
ACM Transactions on Information Systems (TOIS)
Reducing the uncertainty in resource selection
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Snippet-Based relevance predictions for federated web search
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Distributed information retrieval and applications
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Vertical selection in the information domain of children
Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries
Automatic generation of textual image collection descriptions
Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics
Information Systems
Taily: shard selection using the tail of score distributions
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Aggregated search interface preferences in multi-session search tasks
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Which vertical search engines are relevant?
Proceedings of the 22nd international conference on World Wide Web
Exploiting Forum Thread Structures to Improve Thread Clustering
Proceedings of the 2013 Conference on the Theory of Information Retrieval
Merging algorithms for enterprise search
Proceedings of the 18th Australasian Document Computing Symposium
Composite retrieval of heterogeneous web search
Proceedings of the 23rd international conference on World wide web
Hi-index | 0.00 |
Federated search (federated information retrieval or distributed information retrieval) is a technique for searching multiple text collections simultaneously. Queries are submitted to a subset of collections that are most likely to return relevant answers. The results returned by selected collections are integrated and merged into a single list. Federated search is preferred over centralized search alternatives in many environments. For example, commercial search engines such as Google cannot easily index uncrawlable hidden web collections while federated search systems can search the contents of hidden web collections without crawling. In enterprise environments, where each organization maintains an independent search engine, federated search techniques can provide parallel search over multiple collections. There are three major challenges in federated search. For each query, a subset of collections that are most likely to return relevant documents are selected. This creates the collection selection problem. To be able to select suitable collections, federated search systems need to acquire some knowledge about the contents of each collection, creating the collection representation problem. The results returned from the selected collections are merged before the final presentation to the user. This final step is the result merging problem. The goal of this work, is to provide a comprehensive summary of the previous research on the federated search challenges described above.