Privacy-preserving clustering with distributed EM mixture modeling

  • Authors:
  • Xiaodong Lin;Chris Clifton;Michael Zhu

  • Affiliations:
  • Department of Mathematical Sciences, University of Cincinnati, 45221-0025, Cincinnati, OH, USA;Department of Computer Science, Purdue University, 45221-0025, West Lafayette, IN, USA;Department of Statistics, Purdue University, 45221-0025, West Lafayette, IN, USA

  • Venue:
  • Knowledge and Information Systems
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Privacy and security considerations can prevent sharing of data, derailing data mining projects. Distributed knowledge discovery can alleviate this problem. We present a technique that uses EM mixture modeling to perform clustering on distributed data. This method controls data sharing, preventing disclosure of individual data items or any results that can be traced to an individual site.