Maintaining data privacy in association rule mining

  • Authors:
  • Shariq J. Rizvi;Jayant R. Haritsa

  • Affiliations:
  • Computer Science & Engineering, Indian Institute of Technology, Mumbai, India;Database Systems Lab., SERC, Indian Institute of Science, Bangalore, India

  • Venue:
  • VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
  • Year:
  • 2002

Quantified Score

Hi-index 0.01

Visualization

Abstract

Data mining services require accurate input data for their results to be meaningful, but privacy concerns may influence users to provide spurious information. We investigate here, with respect to mining association rules, whether users can be encouraged to provide correct information by ensuring that the mining process cannot, with any reasonable degree of certainty, violate their privacy. We present a scheme, based on probabilistic distortion of user data, that can simultaneously provide a high degree of privacy to the user and retain a high level of accuracy in the mining results. The performance of the scheme is validated against representative real and synthetic datasets.