PISA: A framework for integrating uncooperative peers into P2P-based federated search

Authors:
Gang Chen;Zujie Ren;Lidan Shou;Ke Chen;Yijun Bei
Affiliations:
College of Computer Science and Technology, Zhejiang University, Hangzhou, China;College of Computer Science and Technology, Zhejiang University, Hangzhou, China;College of Computer Science and Technology, Zhejiang University, Hangzhou, China;College of Computer Science and Technology, Zhejiang University, Hangzhou, China;College of Computer Science and Technology, Zhejiang University, Hangzhou, China
Venue:
Computer Communications
Year:
2011

Citing 22
Cited 0

GlOSS: text-source discovery over the Internet

ACM Transactions on Database Systems (TODS)
Database merging strategy based on logistic regression

Information Processing and Management: an International Journal
An information-theoretic approach to automatic query expansion

ACM Transactions on Information Systems (TOIS)
Query-based sampling of text databases

ACM Transactions on Information Systems (TOIS)
Chord: A scalable peer-to-peer lookup service for internet applications

Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
Modern Information Retrieval

Modern Information Retrieval
Text-Based Content Search and Retrieval in Ad-hoc P2P Communities

Revised Papers from the NETWORKING 2002 Workshops on Web Engineering and Peer-to-Peer Computing
Super-peer-based routing and clustering strategies for RDF-based peer-to-peer networks

WWW '03 Proceedings of the 12th international conference on World Wide Web
A semisupervised learning method to merge search engine results

ACM Transactions on Information Systems (TOIS)
When one sample is not enough: improving text database selection using shrinkage

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
The robustness of content-based search in hierarchical peer to peer networks

Proceedings of the thirteenth ACM international conference on Information and knowledge management
MINERVA: collaborative P2P search

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Towards better measures: evaluation of estimated resource description quality for distributed IR

InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
Understanding churn in peer-to-peer networks

Proceedings of the 6th ACM SIGCOMM conference on Internet measurement
Search and browse services for heterogeneous collections with the peer-to-peer network Pepper

Information Processing and Management: an International Journal
Survey of research towards robust peer-to-peer networks: search methods

Computer Networks: The International Journal of Computer and Telecommunications Networking
Modeling and managing changes in text databases

ACM Transactions on Database Systems (TODS)
Updating collection representations for federated search

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Scalable blind search and broadcasting over Distributed Hash Tables

Computer Communications
Full-text federated search in peer-to-peer networks

Full-text federated search in peer-to-peer networks
HyperCuP: hypercubes, ontologies, and efficient search on peer-to-peer networks

AP2PC'02 Proceedings of the 1st international conference on Agents and peer-to-peer computing
Federated search of text-based digital libraries in hierarchical peer-to-peer networks

ECIR'05 Proceedings of the 27th European conference on Advances in Information Retrieval Research

Quantified Score

Hi-index	0.24

Visualization

Abstract

In the past years, federated search over peer-to-peer (P2P) networks has attracted considerable attention from the information retrieval community. Most previous work assumes a so-called cooperative environment, where each peer can actively participate in information publishing and document indexing in a distributed manner. In contrast, little prior work has addressed the problem of incorporating uncooperative peers, which do not publish their own corpus statistics over a network. In this paper, we present a P2P-based federated search framework named PISA, which incorporates uncooperative peers when providing search service. Specifically, we (i) propose a heuristic query-based sampling approach named HQBS, which can obtain high-quality resource descriptions from uncooperative peers at a low communication cost; (ii) present two result merging methods, called RISE and RISE+, to merge the results returned by uncooperative peers; (iii) develop a method called Controlled and Selective Update (CSU) to efficiently maintain the index directory of PISA. Our extensive experiments on the TREC WT10g data set demonstrate that PISA can provide high-quality search results and integrate uncooperative peers at low cost.