Effective anonymization of query logs

  • Authors:
  • Yuan Hong;Xiaoyun He;Jaideep Vaidya;Nabil Adam;Vijayalakshmi Atluri

  • Affiliations:
  • Rutgers University, Newark, NJ, USA;Rutgers University, Newark, NJ, USA;Rutgers University, Newark, NJ, USA;Rutgers University, Newark, NJ, USA;Rutgers University, Newark, NJ, USA

  • Venue:
  • Proceedings of the 18th ACM conference on Information and knowledge management
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

User search query logs have proven to be very useful, but have vast potential for misuse. Several incidents have shown that simple removal of identifiers is insufficient to protect the identity of users. Publishing such inadequately anonymized data can cause severe breach of privacy. While significant effort has been expended on coming up with anonymity models and techniques for microdata, there is little corresponding work for query log data. Query logs are different in several important aspects, such as the diversity of queries and the causes of privacy breach. This necessitates the need to design privacy models and techniques specific to this environment. This paper takes a first cut at tackling this challenge. Our main contribution is to define effective anonymization models for query log data along with proposing techniques to achieve such anonymization. We analyze the inherent utility and privacy tradeoff, and experimentally validate the performance of our techniques.