Shadow document methods of resutls merging

Authors:
Shengli Wu;Fabio Crestani
Affiliations:
University of Strathclyde, Glasgow, UK;University of Strathclyde, Glasgow, UK
Venue:
Proceedings of the 2004 ACM symposium on Applied computing
Year:
2004

Citing 13
Cited 4

Searching distributed collections with inference networks

SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Learning collection fusion strategies

SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Experiences with selecting search engines using metasearch

ACM Transactions on Information Systems (TOIS)
Analyses of multiple evidence combination

Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Efficient and effective metasearch for a large number of text databases

Proceedings of the eighth international conference on Information and knowledge management
Server selection on the World Wide Web

DL '00 Proceedings of the fifth ACM conference on Digital libraries
Database merging strategy based on logistic regression

Information Processing and Management: an International Journal
Building efficient and effective metasearch engines

ACM Computing Surveys (CSUR)
Expert agreement and content based reranking in a meta search environment using Mearf

Proceedings of the 11th international conference on World Wide Web
Using sampled data and regression to merge search engine results

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Fusion Via a Linear Combination of Scores

Information Retrieval
Context and Page Analysis for Improved Web Search

IEEE Internet Computing
Improving the evaluation of web search systems

ECIR'03 Proceedings of the 25th European conference on IR research

Result merging methods in distributed information retrieval with overlapping databases

Information Retrieval
Probability-based fusion of information retrieval result sets

Artificial Intelligence Review
A results merging algorithm for distributed information retrieval environments that combines regression methodologies with a selective download phase

Information Processing and Management: an International Journal
Federated Search

Foundations and Trends in Information Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

In distributed information retrieval systems, document overlaps occur frequently across results from different databases. This is especially the case for meta-search engines which merge results from several general-purpose web search engines. This paper addresses the problem of merging results which contain overlaps in order to achieve better performance. Several algorithms for merging results are proposed, which take advantage of the use of duplicate documents in two ways: one correlates scores from different results; the other regards duplicates as increasing evidence of being relevant to the given query. A variety of experiments have demonstrated that these methods are effective.