Accurate and large-scale privacy-preserving data mining using the election paradigm

Authors:
Emmanouil Magkos;Manolis Maragoudakis;Vassilis Chrissikopoulos;Stefanos Gritzalis
Affiliations:
Department of Informatics, Ionian University, Plateia Tsirigoti 7, Corfu 49100, Greece;Department of Information and Communication Systems Engineering, University of the Aegean, Karlovassi, Samos, Greece;Department of Informatics, Ionian University, Plateia Tsirigoti 7, Corfu 49100, Greece;Department of Information and Communication Systems Engineering, University of the Aegean, Karlovassi, Samos, Greece
Venue:
Data & Knowledge Engineering
Year:
2009

Citing 42
Cited 7

A public key cryptosystem and a signature scheme based on discrete logarithms

Proceedings of CRYPTO 84 on Advances in cryptology
How to play ANY mental game

STOC '87 Proceedings of the nineteenth annual ACM symposium on Theory of computing
How to prove yourself: practical solutions to identification and signature problems

Proceedings on Advances in cryptology---CRYPTO '86
Security and inference in multilevel database and knowledge-base systems

SIGMOD '87 Proceedings of the 1987 ACM SIGMOD international conference on Management of data
Security-control methods for statistical databases: a comparative study

ACM Computing Surveys (CSUR)
Proofs that yield nothing but their validity or all languages in NP have zero-knowledge proof systems

Journal of the ACM (JACM)
Threshold cryptosystems

CRYPTO '89 Proceedings on Advances in cryptology
Receipt-free secret-ballot elections (extended abstract)

STOC '94 Proceedings of the twenty-sixth annual ACM symposium on Theory of computing
Bagging predictors

Machine Learning
Multi party computations: past and present

PODC '97 Proceedings of the sixteenth annual ACM symposium on Principles of distributed computing
Generalizing data to provide anonymity when disclosing information (abstract)

PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Privacy-preserving data mining

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
On the design and quantification of privacy preserving data mining algorithms

PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Practical multi-candidate election system

Proceedings of the twentieth annual ACM symposium on Principles of distributed computing
Random Forests

Machine Learning
Data Mining: An Overview from a Database Perspective

IEEE Transactions on Knowledge and Data Engineering
A Practical Secret Voting Scheme for Large Scale Elections

ASIACRYPT '92 Proceedings of the Workshop on the Theory and Application of Cryptographic Techniques: Advances in Cryptology
The Rampart Toolkit for Building High-Integrity Services

Selected Papers from the International Workshop on Theory and Practice in Distributed Systems
Cryptographic techniques for privacy-preserving data mining

ACM SIGKDD Explorations Newsletter
Revealing information while preserving privacy

Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Privacy preserving mining of association rules

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Privacy preserving association rule mining in vertically partitioned data

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Building decision tree classifier on private data

CRPIT '14 Proceedings of the IEEE international conference on Privacy, security and data mining - Volume 14
Multi-authority secret-ballot elections with linear work

Multi-authority secret-ballot elections with linear work
Verifiable secret-ballot elections

Verifiable secret-ballot elections
State-of-the-art in privacy preserving data mining

ACM SIGMOD Record
The role of cryptography in database security

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Privacy-preserving Bayesian network structure computation on distributed heterogeneous data

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Privacy-Preserving Distributed Mining of Association Rules on Horizontally Partitioned Data

IEEE Transactions on Knowledge and Data Engineering
Private inference control

Proceedings of the 11th ACM conference on Computer and communications security
A new scheme on privacy-preserving data classification

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Privacy-preserving collaborative association rule mining

Journal of Network and Computer Applications
Maintaining data privacy in association rule mining

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Private Inference Control for Aggregate Database Queries

ICDMW '07 Proceedings of the Seventh IEEE International Conference on Data Mining Workshops
Privacy-preserving SVM classification

Knowledge and Information Systems
Transforming semi-honest protocols to ensure accountability

Data & Knowledge Engineering
How to generate and exchange secrets

SFCS '86 Proceedings of the 27th Annual Symposium on Foundations of Computer Science
Accuracy in Privacy-Preserving Data Mining Using the Paradigm of Cryptographic Elections

PSD '08 Proceedings of the UNESCO Chair in data privacy international conference on Privacy in Statistical Databases
Efficient receipt-free voting based on homomorphic encryption

EUROCRYPT'00 Proceedings of the 19th international conference on Theory and application of cryptographic techniques
Towards privacy-preserving model selection

PinKDD'07 Proceedings of the 1st ACM SIGKDD international conference on Privacy, security, and trust in KDD
Privacy-preserving collaborative data mining

IEEE Computational Intelligence Magazine
New directions in cryptography

IEEE Transactions on Information Theory

On the use of economic price theory to find the optimum levels of privacy and information utility in non-perturbative microdata anonymisation

Data & Knowledge Engineering
Privacy-preserving publishing microdata with full functional dependencies

Data & Knowledge Engineering
Privacy-aware collection of aggregate spatial data

Data & Knowledge Engineering
Privacy preserving tree augmented Naïve bayesian multi-party implementation on horizontally partitioned databases

TrustBus'11 Proceedings of the 8th international conference on Trust, privacy and security in digital business
Privacy-preserving back-propagation and extreme learning machine algorithms

Data & Knowledge Engineering
Private client-side profiling with random forests and hidden markov models

PETS'12 Proceedings of the 12th international conference on Privacy Enhancing Technologies
Cloud-enabled privacy-preserving collaborative learning for mobile sensing

Proceedings of the 10th ACM Conference on Embedded Network Sensor Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

With the proliferation of the Web and ICT technologies there have been concerns about the handling and use of sensitive information by data mining systems. Recent research has focused on distributed environments where the participants in the system may also be mutually mistrustful. In this paper we discuss the design and security requirements for large-scale privacy-preserving data mining (PPDM) systems in a fully distributed setting, where each client possesses its own records of private data. To this end we argue in favor of using some well-known cryptographic primitives, borrowed from the literature on Internet elections. More specifically, our framework is based on the classical homomorphic election model, and particularly on an extension for supporting multi-candidate elections. We also review a recent scheme [Z. Yang, S. Zhong, R.N. Wright, Privacy-preserving classification of customer data without loss of accuracy, in: SDM' 2005 SIAM International Conference on Data Mining, 2005] which was the first scheme that used the homomorphic encryption primitive for PPDM in the fully distributed setting. Finally, we show how our approach can be used as a building block to obtain Random Forests classification with enhanced prediction performance.