Towards comprehensive privacy protection in data clustering

  • Authors:
  • Nan Zhang

  • Affiliations:
  • Department of Computer Science and Engineering, University of Texas at Arlington, Arlington, TX

  • Venue:
  • PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

We address the protection of private information in data clustering. Previous work focuses on protecting the privacy of data being mined. We find that the cluster labels of individual data points can also be sensitive to data owners. Thus, we propose a privacy-preserving data clustering scheme that extracts accurate clustering rules from private data while protecting the privacy of both original data and individual cluster labels. We derive theoretical bounds on the performance of our scheme, and evaluate it experimentally with real-world data.