Privacy preserving clustering

Authors:
Somesh Jha;Luis Kruger;Patrick McDaniel
Affiliations:
Computer Sciences Department, University of Wisconsin, Madison, WI;Computer Sciences Department, University of Wisconsin, Madison, WI;Computer Science and Engineering, Pennsylvania State University, University Park, PA
Venue:
ESORICS'05 Proceedings of the 10th European conference on Research in Computer Security
Year:
2005

Citing 34
Cited 26

How to play ANY mental game

STOC '87 Proceedings of the nineteenth annual ACM symposium on Theory of computing
Security-control methods for statistical databases: a comparative study

ACM Computing Surveys (CSUR)
Proofs that yield nothing but their validity or all languages in NP have zero-knowledge proof systems

Journal of the ACM (JACM)
A new public key cryptosystem based on higher residues

CCS '98 Proceedings of the 5th ACM conference on Computer and communications security
Internet privacy

Communications of the ACM
Oblivious transfer and polynomial evaluation

STOC '99 Proceedings of the thirty-first annual ACM symposium on Theory of computing
Quantifying knowledge complexity

Computational Complexity
Privacy-preserving data mining

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Multidimensional curve classification using passing—through regions

Pattern Recognition Letters - Special issue on pattern recognition in practice VI
Efficient generation of shared RSA keys

Journal of the ACM (JACM)
Foundations of Cryptography: Basic Tools

Foundations of Cryptography: Basic Tools
Machine Learning

Machine Learning
Distributed clustering using collective principal component analysis

Knowledge and Information Systems
Induction of Decision Trees

Machine Learning
Refining Initial Points for K-Means Clustering

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Secure Multiparty Computation of Approximations

ICALP '01 Proceedings of the 28th International Colloquium on Automata, Languages and Programming,
Two Party RSA Key Generation

CRYPTO '99 Proceedings of the 19th Annual International Cryptology Conference on Advances in Cryptology
Privacy Preserving Data Mining

CRYPTO '00 Proceedings of the 20th Annual International Cryptology Conference on Advances in Cryptology
A Statistical Method for Profiling Network Traffic

Proceedings of the Workshop on Intrusion Detection and Network Monitoring
A Data-Clustering Algorithm on Distributed Memory Multiprocessors

Revised Papers from Large-Scale Parallel Data Mining, Workshop on Large-Scale Parallel KDD Systems, SIGKDD
Privacy preserving mining of association rules

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Privacy preserving association rule mining in vertically partitioned data

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Privacy-enhancing technologies for the Internet

COMPCON '97 Proceedings of the 42nd IEEE International Computer Conference
Privacy-Preserving Cooperative Statistical Analysis

ACSAC '01 Proceedings of the 17th Annual Computer Security Applications Conference
Clustering intrusion detection alarms to support root cause analysis

ACM Transactions on Information and System Security (TISSEC)
Privacy-preserving Distributed Clustering using Generative Models

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
Privacy-preserving k-means clustering over vertically partitioned data

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Foundations of Cryptography: Volume 2, Basic Applications

Foundations of Cryptography: Volume 2, Basic Applications
Fairplay—a secure two-party computation system

SSYM'04 Proceedings of the 13th conference on USENIX Security Symposium - Volume 13
Maintaining data privacy in association rule mining

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
How to generate and exchange secrets

SFCS '86 Proceedings of the 27th Annual Symposium on Foundations of Computer Science
Distributed clustering based on sampling local density estimates

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Public-key cryptosystems based on composite degree residuosity classes

EUROCRYPT'99 Proceedings of the 17th international conference on Theory and application of cryptographic techniques

Privacy-preserving demographic filtering

Proceedings of the 2006 ACM symposium on Applied computing
Privacy-preserving boosting

Data Mining and Knowledge Discovery
Privacy preserving clustering on horizontally partitioned data

Data & Knowledge Engineering
Secure two-party k-means clustering

Proceedings of the 14th ACM conference on Computer and communications security
An agent-based approach for privacy-preserving recommender systems

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Privacy Preserving DBSCAN Algorithm for Clustering

ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
Privacy Preserving BIRCH Algorithm for Clustering over Arbitrarily Partitioned Databases

ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
Reconstructing Data Perturbed by Random Projections When the Mixing Matrix Is Known

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Secure two and multi-party association rule mining

CISDA'09 Proceedings of the Second IEEE international conference on Computational intelligence for security and defense applications
Communication-Efficient Privacy-Preserving Clustering

Transactions on Data Privacy
Efficient privacy preserving distributed clustering based on secret sharing

PAKDD'07 Proceedings of the 2007 international conference on Emerging technologies in knowledge discovery and data mining
Large-scale k-means clustering with user-centric privacy preservation

PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Phoenix: privacy preserving biclustering on horizontally partitioned data

PinKDD'07 Proceedings of the 1st ACM SIGKDD international conference on Privacy, security, and trust in KDD
Collusion-resistant privacy-preserving data mining

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
PCTA: privacy-constrained clustering-based transaction data anonymization

Proceedings of the 4th International Workshop on Privacy and Anonymity in the Information Society
Privacy preserving spatio-temporal clustering on horizontally partitioned data

Ubiquitous knowledge discovery
Privacy preserving spatio-temporal clustering on horizontally partitioned data

Ubiquitous knowledge discovery
Benaloh's dense probabilistic encryption revisited

AFRICACRYPT'11 Proceedings of the 4th international conference on Progress in cryptology in Africa
Privacy preserving spatio-temporal clustering on horizontally partitioned data

DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery
Privacy preserving BIRCH algorithm for clustering over vertically partitioned databases

SDM'06 Proceedings of the Third VLDB international conference on Secure Data Management
Efficient privacy preserving k-means clustering

PAISI'10 Proceedings of the 2010 Pacific Asia conference on Intelligence and Security Informatics
Computation of ratios of secure summations in multi-party privacy-preserving latent dirichlet allocation

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Utility-guided Clustering-based Transaction Data Anonymization

Transactions on Data Privacy
Privacy-Preserving EM algorithm for clustering on social network

PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Equally contributory privacy-preserving k-means clustering over vertically partitioned data

Information Systems
A Knowledge Model Sharing Based Approach to Privacy-Preserving Data Mining

Transactions on Data Privacy

Quantified Score

Hi-index	0.00

Visualization

Abstract

The freedom and transparency of information flow on the Internet has heightened concerns of privacy. Given a set of data items, clustering algorithms group similar items together. Clustering has many applications, such as customerbehavior analysis, targeted marketing, forensics, and bioinformatics. In this paper, we present the design and analysis of a privacy-preserving k-means clustering algorithm, where only the cluster means at the various steps of the algorithm are revealed to the participating parties. The crucial step in our privacy-preserving k-means is privacy-preserving computation of cluster means.We present two protocols (one based on oblivious polynomial evaluation and the second based on homomorphic encryption) for privacy-preserving computation of cluster means. We have a JAVA implementation of our algorithm. Using our implementation, we have performed a thorough evaluation of our privacy-preserving clustering algorithm on three data sets. Our evaluation demonstrates that privacy-preserving clustering is feasible, i.e., our homomorphic-encryption based algorithm finished clustering a large data set in approximately 66 seconds.