Privacy Preserving Data Mining

Authors:
Yehuda Lindell;Benny Pinkas
Affiliations:
-;-
Venue:
CRYPTO '00 Proceedings of the 20th Annual International Cryptology Conference on Advances in Cryptology
Year:
2000

Citing 15
Cited 99

A randomized protocol for signing contracts

Communications of the ACM
How to play ANY mental game

STOC '87 Proceedings of the nineteenth annual ACM symposium on Theory of computing
Completeness theorems for non-cryptographic fault-tolerant distributed computation

STOC '88 Proceedings of the twentieth annual ACM symposium on Theory of computing
Multiparty unconditionally secure protocols

STOC '88 Proceedings of the twentieth annual ACM symposium on Theory of computing
Uses of randomness in algorithms and protocols

Uses of randomness in algorithms and protocols
Comparing information without leaking it

Communications of the ACM
Oblivious transfer and polynomial evaluation

STOC '99 Proceedings of the thirty-first annual ACM symposium on Theory of computing
Machine Learning

Machine Learning
Induction of Decision Trees

Machine Learning
Two Party RSA Key Generation

CRYPTO '99 Proceedings of the 19th Annual International Cryptology Conference on Advances in Cryptology
Non-Interactive Oblivious Transfer and Spplications

CRYPTO '89 Proceedings of the 9th Annual International Cryptology Conference on Advances in Cryptology
Efficient Generation of Shared RSA Keys (Extended Abstract)

CRYPTO '97 Proceedings of the 17th Annual International Cryptology Conference on Advances in Cryptology
Private information retrieval

FOCS '95 Proceedings of the 36th Annual Symposium on Foundations of Computer Science
How to generate and exchange secrets

SFCS '86 Proceedings of the 27th Annual Symposium on Foundations of Computer Science
Public-key cryptosystems based on composite degree residuosity classes

EUROCRYPT'99 Proceedings of the 17th international conference on Theory and application of cryptographic techniques

Local and Global Methods in Data Mining: Basic Techniques and Open Problems

ICALP '02 Proceedings of the 29th International Colloquium on Automata, Languages and Programming
Oblivious Polynomial Evaluation and Oblivious Neural Learning

ASIACRYPT '01 Proceedings of the 7th International Conference on the Theory and Application of Cryptology and Information Security: Advances in Cryptology
Privacy-Preserving Cooperative Scientific Computations

CSFW '01 Proceedings of the 14th IEEE workshop on Computer Security Foundations
Privacy-preserving Distributed Clustering using Generative Models

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Privacy Preserving Data Mining Research: Current Status and Key Issues

ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
Privacy Preserving DBSCAN Algorithm for Clustering

ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
Privacy Preserving BIRCH Algorithm for Clustering over Arbitrarily Partitioned Databases

ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
Secure Construction of Contingency Tables from Distributed Data

Proceeedings of the 22nd annual IFIP WG 11.3 working conference on Data and Applications Security
Improved Garbled Circuit: Free XOR Gates and Applications

ICALP '08 Proceedings of the 35th international colloquium on Automata, Languages and Programming, Part II
A Distributed Privacy-Preserving Association Rules Mining Scheme Using Frequent-Pattern Tree

ADMA '08 Proceedings of the 4th international conference on Advanced Data Mining and Applications
A Practical Universal Circuit Construction and Secure Evaluation of Private Functions

Financial Cryptography and Data Security
An Anonymity Model Achievable Via Microaggregation

SDM '08 Proceedings of the 5th VLDB workshop on Secure Data Management
Privacy Preserving Data Mining within Anonymous Credential Systems

SCN '08 Proceedings of the 6th international conference on Security and Cryptography for Networks
Secure Multi-party Protocols for Privacy Preserving Data Mining

WASA '08 Proceedings of the Third International Conference on Wireless Algorithms, Systems, and Applications
Efficient Privacy-Preserving Link Discovery

PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
An Efficient Approximate Protocol for Privacy-Preserving Association Rule Mining

PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Privacy-Preserving Distributed Learning Based on Genetic Algorithms and Artificial Neural Networks

IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part II: Distributed Computing, Artificial Intelligence, Bioinformatics, Soft Computing, and Ambient Assisted Living
Preserving Privacy in Time Series Data Classification by Discretization

MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
A distributed approach to enabling privacy-preserving model-based classifier training

Knowledge and Information Systems
Reconstructing Data Perturbed by Random Projections When the Mixing Matrix Is Known

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Programming with differential privacy: technical perspective

Communications of the ACM
Privacy-preserving outsourcing support vector machines with random transformation

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Collusion-resistant privacy-preserving data mining

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
A new class of attacks on time series data mining\m{1}

Intelligent Data Analysis
Privacy-aware regression modeling of participatory sensing data

Proceedings of the 8th ACM Conference on Embedded Networked Sensor Systems
Collaborative, privacy-preserving data aggregation at scale

PETS'10 Proceedings of the 10th international conference on Privacy enhancing technologies
Output privacy in data mining

ACM Transactions on Database Systems (TODS)
Discord region based analysis to improve data utility of privately published time series

ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications: Part I
Efficient privacy-preserving data mining in malicious model

ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications: Part I
Privacy-preserving data mining in presence of covert adversaries

ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications: Part I
(If) size matters: size-hiding private set intersection

PKC'11 Proceedings of the 14th international conference on Practice and theory in public key cryptography conference on Public key cryptography
Quadratic error minimization in a distributed environment with privacy preserving

PSDML'10 Proceedings of the international ECML/PKDD conference on Privacy and security issues in data mining and machine learning
Secure top-k subgroup discovery

PSDML'10 Proceedings of the international ECML/PKDD conference on Privacy and security issues in data mining and machine learning
Privacy-preserving data mining: a game-theoretic approach

DBSec'11 Proceedings of the 25th annual IFIP WG 11.3 conference on Data and applications security and privacy
Non-metric multidimensional scaling for privacy-preserving data clustering

IDEAL'11 Proceedings of the 12th international conference on Intelligent data engineering and automated learning
Automatically optimizing secure computation

Proceedings of the 18th ACM conference on Computer and communications security
Privacy preserving feature selection for distributed data using virtual dimension

Proceedings of the 20th ACM international conference on Information and knowledge management
Preserving privacy of moving objects via temporal clustering of spatio-temporal data streams

Proceedings of the 4th ACM SIGSPATIAL International Workshop on Security and Privacy in GIS and LBS
Differential privacy for location pattern mining

Proceedings of the 4th ACM SIGSPATIAL International Workshop on Security and Privacy in GIS and LBS
Implementation, optimization and performance tests of privacy preserving mechanisms in homogeneous collaborative association rules mining

OTM'11 Proceedings of the 2011th Confederated international conference on On the move to meaningful internet systems - Volume Part I
An agent based privacy preserving mining for distributed databases

CIS'04 Proceedings of the First international conference on Computational and Information Science
k-Anonymous Decision Tree Induction

PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
Privacy-preserving decision tree mining based on random substitutions

ETRICS'06 Proceedings of the 2006 international conference on Emerging Trends in Information and Communication Security
Efficient classification from multiple heterogeneous databases

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
Can attackers learn from samples?

SDM'05 Proceedings of the Second VDLB international conference on Secure Data Management
Privacy preserving DBSCAN for vertically partitioned data

ISI'06 Proceedings of the 4th IEEE international conference on Intelligence and Security Informatics
Sanitization of databases for refined privacy trade-offs

ISI'06 Proceedings of the 4th IEEE international conference on Intelligence and Security Informatics
Accountable privacy

SP'04 Proceedings of the 12th international conference on Security Protocols
Blind vision

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part III
Privacy preserving unsupervised clustering over vertically partitioned data

ICCSA'06 Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part V
Ensuring data security against knowledge discovery in distributed information systems

RSFDGrC'05 Proceedings of the 10th international conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing - Volume Part II
Anonymizing tables

ICDT'05 Proceedings of the 10th international conference on Database Theory
Inference on distributed data clustering

MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
Privacy-Preserving decision trees over vertically partitioned data

DBSec'05 Proceedings of the 19th annual IFIP WG 11.3 working conference on Data and Applications Security
Privacy-Preserving collaborative association rule mining

DBSec'05 Proceedings of the 19th annual IFIP WG 11.3 working conference on Data and Applications Security
Blind custodians: a database service architecture that supports privacy without encryption

DBSec'05 Proceedings of the 19th annual IFIP WG 11.3 working conference on Data and Applications Security
Secure computation of the mean and related statistics

TCC'05 Proceedings of the Second international conference on Theory of Cryptography
A privacy-preserving classification mining algorithm

PAKDD'05 Proceedings of the 9th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Performance evaluation of an agent based distributed data mining system

AI'05 Proceedings of the 18th Canadian Society conference on Advances in Artificial Intelligence
Private mining of association rules

ISI'05 Proceedings of the 2005 IEEE international conference on Intelligence and Security Informatics
Privacy preserving clustering

ESORICS'05 Proceedings of the 10th European conference on Research in Computer Security
Hiding co-occurring prioritized sensitive patterns over distributed progressive sequential data streams

Journal of Network and Computer Applications
Privacy preserving BIRCH algorithm for clustering over vertically partitioned databases

SDM'06 Proceedings of the Third VLDB international conference on Secure Data Management
Efficient privacy preserving k-means clustering

PAISI'10 Proceedings of the 2010 Pacific Asia conference on Intelligence and Security Informatics
Conditional encrypted mapping and comparing encrypted numbers

FC'06 Proceedings of the 10th international conference on Financial Cryptography and Data Security
Polylogarithmic private approximations and efficient matching

TCC'06 Proceedings of the Third conference on Theory of Cryptography
Secure computation, i/o-efficient algorithms and distributed signatures

CT-RSA'12 Proceedings of the 12th conference on Topics in Cryptology
Forecasting using rules extracted from privacy preservation neural network

MIWAI'11 Proceedings of the 5th international conference on Multi-Disciplinary Trends in Artificial Intelligence
Fast computation on encrypted polynomials and applications

CANS'11 Proceedings of the 10th international conference on Cryptology and Network Security
More than modelling and hiding: towards a comprehensive view of Web mining and privacy

Data Mining and Knowledge Discovery
Secure Distributed Subgroup Discovery in Horizontally Partitioned Data

Transactions on Data Privacy
Privacy Preserving Aggregation of Secret Classifiers

Transactions on Data Privacy
Fully homomorphic encryption based two-party association rule mining

Data & Knowledge Engineering
Privacy preserving protocol for service aggregation in cloud computing

Software—Practice & Experience
Privacy-preserving ranking over vertically partitioned data

Proceedings of the 2012 Joint EDBT/ICDT Workshops
Efficient robust private set intersection

International Journal of Applied Cryptography
Privacy-preserving back-propagation and extreme learning machine algorithms

Data & Knowledge Engineering
Approximate privacy-preserving data mining on vertically partitioned data

DBSec'12 Proceedings of the 26th Annual IFIP WG 11.3 conference on Data and Applications Security and Privacy
Privacy-preserving content-based recommender system

Proceedings of the on Multimedia and security
Billion-gate secure computation with malicious adversaries

Security'12 Proceedings of the 21st USENIX conference on Security symposium
Privacy Aware Data Management and Chase

Fundamenta Informaticae - Special issue ISMIS'05
Privacy-preserving genetic algorithms for rule discovery

DaWaK'07 Proceedings of the 9th international conference on Data Warehousing and Knowledge Discovery
Privacy-preserving self-organizing map

DaWaK'07 Proceedings of the 9th international conference on Data Warehousing and Knowledge Discovery
Privacy preserving K-Medoids clustering: an approach towards securing data in Mobile cloud architecture

Proceedings of the Second International Conference on Computational Science, Engineering and Information Technology
Private multiparty sampling and approximation of vector combinations

ICALP'07 Proceedings of the 34th international conference on Automata, Languages and Programming
Study of a fuzzy comprehensive evaluation problem in cloud computing

AICI'12 Proceedings of the 4th international conference on Artificial Intelligence and Computational Intelligence
Privacy preserving neural networks in iris signature feature extraction

Proceedings of the 5th International Conference on PErvasive Technologies Related to Assistive Environments
A Knowledge Model Sharing Based Approach to Privacy-Preserving Data Mining

Transactions on Data Privacy
Preserving Privacy in Time Series Data Mining

International Journal of Data Warehousing and Mining
Privacy-preserving disjunctive normal form operations on distributed sets

Information Sciences: an International Journal
Secure Bayesian model averaging for horizontally partitioned data

Statistics and Computing
Secure two-party association rule mining

AISC '11 Proceedings of the Ninth Australasian Information Security Conference - Volume 116
Bands of privacy preserving objectives: classification of PPDM strategies

AusDM '11 Proceedings of the Ninth Australasian Data Mining Conference - Volume 121
Trends and research directions for privacy preserving approaches on the cloud

Proceedings of the 6th ACM India Computing Convention
Towards privacy-preserving fault detection

Proceedings of the 9th Workshop on Hot Topics in Dependable Systems
Incorporating Privacy into the Undergraduate Curriculum

Proceedings of the 2013 on InfoSecCD '13: Information Security Curriculum Development Conference
Secure outsourced garbled circuit evaluation for mobile devices

SEC'13 Proceedings of the 22nd USENIX conference on Security
Dividing secrets to secure data outsourcing

Information Sciences: an International Journal
A generic and distributed privacy preserving classification method with a worst-case privacy guarantee

Distributed and Parallel Databases

Quantified Score

Hi-index	0.02

Visualization

Abstract

In this paper we introduce the concept of privacy preserving data mining. In our model, two parties owning confidential databases wish to run a data mining algorithm on the union of their databases, without revealing any unnecessary information. This problem has many practical and important applications, such as in medical research with confidential patient records. Data mining algorithms are usually complex, especially as the size of the input is measured in megabytes, if not gigabytes. A generic secure multi-party computation solution, based on evaluation of a circuit computing the algorithm on the entire input, is therefore of no practical use. We focus on the problem of decision tree learning and use ID3, a popular and widely used algorithm for this problem. We present a solution that is considerably more efficient than generic solutions. It demands very few rounds of communication and reasonable bandwidth. In our solution, each party performs by itself a computation of the same order as computing the ID3 algorithm for its own database. The results are then combined using efficient cryptographic protocols, whose overhead is only logarithmic in the number of transactions in the databases. We feel that our result is a substantial contribution, demonstrating that secure multi-party computation can be made practical, even for complex problems and large inputs.