A versatile tool for privacy-enhanced web search

Authors:
Avi Arampatzis;George Drosatos;Pavlos S. Efraimidis
Affiliations:
Department of Electrical and Computer Engineering, Democritus University of Thrace, Xanthi, Greece;Department of Electrical and Computer Engineering, Democritus University of Thrace, Xanthi, Greece;Department of Electrical and Computer Engineering, Democritus University of Thrace, Xanthi, Greece
Venue:
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Year:
2013

Citing 8
Cited 0

Class-based n-gram models of natural language

Computational Linguistics
Query-based sampling of text databases

ACM Transactions on Information Systems (TOIS)
A Comparative Study on Feature Selection in Text Categorization

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
k-anonymity: a model for protecting privacy

International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems
Frequency estimates for statistical word similarity measures

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
A picture of search

InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
Introduction to Information Retrieval

Introduction to Information Retrieval
Enhancing deniability against query-logs

ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

We consider the problem of privacy leaks suffered by Internet users when they perform web searches, and propose a framework to mitigate them. Our approach, which builds upon and improves recent work on search privacy, approximates the target search results by replacing the private user query with a set of blurred or scrambled queries. The results of the scrambled queries are then used to cover the original user interest. We model the problem theoretically, define a set of privacy objectives with respect to web search and investigate the effectiveness of the proposed solution with a set of real queries on a large web collection. Experiments show great improvements in retrieval effectiveness over a previously reported baseline in the literature. Furthermore, the methods are more versatile, predictably-behaved, applicable to a wider range of information needs, and the privacy they provide is more comprehensible to the end-user.