GlOSS: text-source discovery over the Internet
ACM Transactions on Database Systems (TODS)
Database merging strategy based on logistic regression
Information Processing and Management: an International Journal
An information-theoretic approach to automatic query expansion
ACM Transactions on Information Systems (TOIS)
Query-based sampling of text databases
ACM Transactions on Information Systems (TOIS)
Chord: A scalable peer-to-peer lookup service for internet applications
Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
Modern Information Retrieval
Text-Based Content Search and Retrieval in Ad-hoc P2P Communities
Revised Papers from the NETWORKING 2002 Workshops on Web Engineering and Peer-to-Peer Computing
Super-peer-based routing and clustering strategies for RDF-based peer-to-peer networks
WWW '03 Proceedings of the 12th international conference on World Wide Web
A semisupervised learning method to merge search engine results
ACM Transactions on Information Systems (TOIS)
When one sample is not enough: improving text database selection using shrinkage
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
The robustness of content-based search in hierarchical peer to peer networks
Proceedings of the thirteenth ACM international conference on Information and knowledge management
MINERVA: collaborative P2P search
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Towards better measures: evaluation of estimated resource description quality for distributed IR
InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
Understanding churn in peer-to-peer networks
Proceedings of the 6th ACM SIGCOMM conference on Internet measurement
Search and browse services for heterogeneous collections with the peer-to-peer network Pepper
Information Processing and Management: an International Journal
Survey of research towards robust peer-to-peer networks: search methods
Computer Networks: The International Journal of Computer and Telecommunications Networking
Modeling and managing changes in text databases
ACM Transactions on Database Systems (TODS)
Updating collection representations for federated search
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Scalable blind search and broadcasting over Distributed Hash Tables
Computer Communications
Full-text federated search in peer-to-peer networks
Full-text federated search in peer-to-peer networks
HyperCuP: hypercubes, ontologies, and efficient search on peer-to-peer networks
AP2PC'02 Proceedings of the 1st international conference on Agents and peer-to-peer computing
Federated search of text-based digital libraries in hierarchical peer-to-peer networks
ECIR'05 Proceedings of the 27th European conference on Advances in Information Retrieval Research
Hi-index | 0.24 |
In the past years, federated search over peer-to-peer (P2P) networks has attracted considerable attention from the information retrieval community. Most previous work assumes a so-called cooperative environment, where each peer can actively participate in information publishing and document indexing in a distributed manner. In contrast, little prior work has addressed the problem of incorporating uncooperative peers, which do not publish their own corpus statistics over a network. In this paper, we present a P2P-based federated search framework named PISA, which incorporates uncooperative peers when providing search service. Specifically, we (i) propose a heuristic query-based sampling approach named HQBS, which can obtain high-quality resource descriptions from uncooperative peers at a low communication cost; (ii) present two result merging methods, called RISE and RISE+, to merge the results returned by uncooperative peers; (iii) develop a method called Controlled and Selective Update (CSU) to efficiently maintain the index directory of PISA. Our extensive experiments on the TREC WT10g data set demonstrate that PISA can provide high-quality search results and integrate uncooperative peers at low cost.