Practical privacy: the SuLQ framework

Authors:
Avrim Blum;Cynthia Dwork;Frank McSherry;Kobbi Nissim
Affiliations:
Carnegie Mellon University, Pittsburgh, PA;Microsoft Research, SVC, La Avenida, Mountain View, CA;Microsoft Research, SVC, La Avenida, Mountain View, CA;Ben-Gurion University, Israel
Venue:
Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Year:
2005

Citing 12
Cited 125

Security-control methods for statistical databases: a comparative study

ACM Computing Surveys (CSUR)
Efficient noise-tolerant learning from statistical queries

Journal of the ACM (JACM)
Secure statistical databases with random sample queries

ACM Transactions on Database Systems (TODS)
Privacy-preserving data mining

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
On the design and quantification of privacy preserving data mining algorithms

PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Private approximation of NP-hard functions

STOC '01 Proceedings of the thirty-third annual ACM symposium on Theory of computing
Induction of Decision Trees

Machine Learning
Randomizing, A Practical Method for Protecting Statistical Databases Against Compromise

VLDB '82 Proceedings of the 8th International Conference on Very Large Data Bases
Revealing information while preserving privacy

Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Limiting privacy breaches in privacy preserving data mining

Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Information Theory, Inference & Learning Algorithms

Information Theory, Inference & Learning Algorithms
Toward privacy in public databases

TCC'05 Proceedings of the Second international conference on Theory of Cryptography

Privacy via pseudorandom sketches

Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
A privacy-preserving technique for Euclidean distance-based mining algorithms using Fourier-related transforms

The VLDB Journal — The International Journal on Very Large Data Bases
Towards robustness in query auditing

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
L-diversity: Privacy beyond k-anonymity

ACM Transactions on Knowledge Discovery from Data (TKDD)
Wherefore art thou r3579x?: anonymized social networks, hidden patterns, and structural steganography

Proceedings of the 16th international conference on World Wide Web
Smooth sensitivity and sampling in private data analysis

Proceedings of the thirty-ninth annual ACM symposium on Theory of computing
The price of privacy and the limits of LP decoding

Proceedings of the thirty-ninth annual ACM symposium on Theory of computing
Privacy, accuracy, and consistency too: a holistic solution to contingency table release

Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Large-scale collection and sanitization of network security data: risks and challenges

NSPW '06 Proceedings of the 2006 workshop on New security paradigms
Challenges in mining social network data: processes, privacy, and paradoxes

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Secure two-party k-means clustering

Proceedings of the 14th ACM conference on Computer and communications security
Privacy-preserving remote diagnostics

Proceedings of the 14th ACM conference on Computer and communications security
Time series compressibility and privacy

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
The boundary between privacy and utility in data publishing

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
On static and dynamic methods for condensation-based privacy-preserving data mining

ACM Transactions on Database Systems (TODS)
A learning theory approach to non-interactive database privacy

STOC '08 Proceedings of the fortieth annual ACM symposium on Theory of computing
Dynamic anonymization: accurate statistical analysis with privacy preservation

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Preservation of proximity privacy in publishing numerical sensitive data

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Epistemic privacy

Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Providing k-anonymity in data mining

The VLDB Journal — The International Journal on Very Large Data Bases
Workload-aware anonymization techniques for large-scale datasets

ACM Transactions on Database Systems (TODS)
The cost of privacy: destruction of data-mining utility in anonymized data publishing

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Composition attacks and auxiliary information in data privacy

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Learning about the world through long-term query logs

ACM Transactions on the Web (TWEB)
Distributed Private Data Analysis: Simultaneously Solving How and What

CRYPTO 2008 Proceedings of the 28th Annual conference on Cryptology: Advances in Cryptology
New Efficient Attacks on Statistical Disclosure Control Mechanisms

CRYPTO 2008 Proceedings of the 28th Annual conference on Cryptology: Advances in Cryptology
Simulatable Binding: Beyond Simulatable Auditing

SDM '08 Proceedings of the 5th VLDB workshop on Secure Data Management
Output perturbation with query relaxation

Proceedings of the VLDB Endowment
The Differential Privacy Frontier (Extended Abstract)

TCC '09 Proceedings of the 6th Theory of Cryptography Conference on Theory of Cryptography
Universally utility-maximizing privacy mechanisms

Proceedings of the forty-first annual ACM symposium on Theory of computing
Private coresets

Proceedings of the forty-first annual ACM symposium on Theory of computing
Differential privacy and robust statistics

Proceedings of the forty-first annual ACM symposium on Theory of computing
Differentially private recommender systems: building privacy into the net

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Privacy integrated queries: an extensible platform for privacy-preserving data analysis

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Privacy-Preserving Query Answering in Logic-based Information Systems

Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
A Bayesian model for disclosure control in statistical databases

Data & Knowledge Engineering
Privacy-Preserving Data Publishing

Foundations and Trends in Databases
Privacy without noise

Proceedings of the 18th ACM conference on Information and knowledge management
Learning your identity and disease from research papers: information leaks in genome wide association study

Proceedings of the 16th ACM conference on Computer and communications security
Asymptotically Optimal and Private Statistical Estimation

CANS '09 Proceedings of the 8th International Conference on Cryptology and Network Security
Differential privacy with compression

ISIT'09 Proceedings of the 2009 IEEE international conference on Symposium on Information Theory - Volume 4
The hardness and approximation algorithms for l-diversity

Proceedings of the 13th International Conference on Extending Database Technology
Beyond k-Anonymity: A Decision Theoretic Framework for Assessing Privacy Risk

Transactions on Data Privacy
Privacy-preserving data publishing: A survey of recent developments

ACM Computing Surveys (CSUR)
Differential privacy for collaborative security

Proceedings of the Third European Workshop on System Security
Private approximation of clustering and vertex cover

TCC'07 Proceedings of the 4th conference on Theory of cryptography
K-anonymization with minimal loss of information

ESA'07 Proceedings of the 15th annual European conference on Algorithms
Differential privacy: a survey of results

TAMC'08 Proceedings of the 5th international conference on Theory and applications of models of computation
An ad omnia approach to defining and achieving private data analysis

PinKDD'07 Proceedings of the 1st ACM SIGKDD international conference on Privacy, security, and trust in KDD
On the geometry of differential privacy

Proceedings of the forty-second ACM symposium on Theory of computing
Differential privacy under continual observation

Proceedings of the forty-second ACM symposium on Theory of computing
The price of privately releasing contingency tables and the spectra of random matrices with correlated rows

Proceedings of the forty-second ACM symposium on Theory of computing
Towards an axiomatization of statistical privacy and utility

Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Differentially private aggregation of distributed time-series with transformation and encryption

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Preserving privacy and fairness in peer-to-peer data integration

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Privacy integrated queries: an extensible platform for privacy-preserving data analysis

Communications of the ACM
Data mining with differential privacy

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Airavat: security and privacy for MapReduce

NSDI'10 Proceedings of the 7th USENIX conference on Networked systems design and implementation
xBook: redesigning privacy control in social networking platforms

SSYM'09 Proceedings of the 18th conference on USENIX security symposium
Centralized and Distributed Anonymization for High-Dimensional Healthcare Data

ACM Transactions on Knowledge Discovery from Data (TKDD)
Distance makes the types grow stronger: a calculus for differential privacy

Proceedings of the 15th ACM SIGPLAN international conference on Functional programming
A firm foundation for private data analysis

Communications of the ACM
Epistemic privacy

Journal of the ACM (JACM)
Differential privacy in new settings

SODA '10 Proceedings of the twenty-first annual ACM-SIAM symposium on Discrete Algorithms
P4P: practical large-scale privacy-preserving distributed computation robust against malicious users

USENIX Security'10 Proceedings of the 19th USENIX conference on Security
Social Network Analysis and Mining for Business Applications

ACM Transactions on Intelligent Systems and Technology (TIST)
PCPs and the hardness of generating private synthetic data

TCC'11 Proceedings of the 8th conference on Theory of cryptography
Limits of computational differential privacy in the client/server setting

TCC'11 Proceedings of the 8th conference on Theory of cryptography
No free lunch in data privacy

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Selling privacy at auction

Proceedings of the 12th ACM conference on Electronic commerce
Privately releasing conjunctions and the statistical query barrier

Proceedings of the forty-third annual ACM symposium on Theory of computing
Privacy-preserving statistical estimation with optimal convergence rates

Proceedings of the forty-third annual ACM symposium on Theory of computing
Preserving privacy in data mining via importance weighting

PSDML'10 Proceedings of the international ECML/PKDD conference on Privacy and security issues in data mining and machine learning
Large margin multiclass gaussian classification with differential privacy

PSDML'10 Proceedings of the international ECML/PKDD conference on Privacy and security issues in data mining and machine learning
Differential privacy under fire

SEC'11 Proceedings of the 20th USENIX conference on Security
Anonymity meets game theory: secure data integration with malicious participants

The VLDB Journal — The International Journal on Very Large Data Bases
To release or not to release: evaluating information leaks in aggregate human-genome data

ESORICS'11 Proceedings of the 16th European conference on Research in computer security
Wherefore art thou R3579X?: anonymized social networks, hidden patterns, and structural steganography

Communications of the ACM
Adversarial machine learning

Proceedings of the 4th ACM workshop on Security and artificial intelligence
How much is enough? choosing ε for differential privacy

ISC'11 Proceedings of the 14th international conference on Information security
Sharing graphs using differentially private graph models

Proceedings of the 2011 ACM SIGCOMM conference on Internet measurement conference
What Can We Learn Privately?

SIAM Journal on Computing
Ask a better question, get a better answer a new approach to private data analysis

ICDT'07 Proceedings of the 11th international conference on Database Theory
Private data release via learning thresholds

Proceedings of the twenty-third annual ACM-SIAM symposium on Discrete Algorithms
Submodular functions are noise stable

Proceedings of the twenty-third annual ACM-SIAM symposium on Discrete Algorithms
Differential privacy

ICALP'06 Proceedings of the 33rd international conference on Automata, Languages and Programming - Volume Part II
Bounds on the sample complexity for private learning and private data release

TCC'10 Proceedings of the 7th international conference on Theory of Cryptography
When random sampling preserves privacy

CRYPTO'06 Proceedings of the 26th annual international conference on Advances in Cryptology
Our data, ourselves: privacy via distributed noise generation

EUROCRYPT'06 Proceedings of the 24th annual international conference on The Theory and Applications of Cryptographic Techniques
Beyond k-anonymity: a decision theoretic framework for assessing privacy risk

PSD'06 Proceedings of the 2006 CENEX-SDC project international conference on Privacy in Statistical Databases
Calibrating noise to sensitivity in private data analysis

TCC'06 Proceedings of the Third conference on Theory of Cryptography
Utility-preserving transaction data anonymization with low information loss

Expert Systems with Applications: An International Journal
Noiseless database privacy

ASIACRYPT'11 Proceedings of the 17th international conference on The Theory and Application of Cryptology and Information Security
k-Concealment: An Alternative Model of k-Type Anonymity

Transactions on Data Privacy
A Practical Differentially Private Random Decision Tree Classifier

Transactions on Data Privacy
Beating randomized response on incoherent matrices

STOC '12 Proceedings of the forty-fourth annual ACM symposium on Theory of computing
Towards statistical queries over distributed private user data

NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
Iterative constructions and private data release

TCC'12 Proceedings of the 9th international conference on Theory of Cryptography
A complete characterization of statistical query learning with applications to evolvability

Journal of Computer and System Sciences
PrivBasis: frequent itemset mining with differential privacy

Proceedings of the VLDB Endowment
Adaptive differentially private histogram of low-dimensional data

PETS'12 Proceedings of the 12th international conference on Privacy Enhancing Technologies
Faster algorithms for privately releasing marginals

ICALP'12 Proceedings of the 39th international colloquium conference on Automata, Languages, and Programming - Volume Part I
Equally contributory privacy-preserving k-means clustering over vertically partitioned data

Information Systems
A clustering approach for structural k-anonymity in social networks using genetic algorithm

Proceedings of the CUBE International Information Technology Conference
Characterizing the sample complexity of private learners

Proceedings of the 4th conference on Innovations in Theoretical Computer Science
Linear dependent types for differential privacy

POPL '13 Proceedings of the 40th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Differential privacy data release through adding noise on average value

NSS'12 Proceedings of the 6th international conference on Network and System Security
Differential private trajectory protection of moving objects

Proceedings of the Third ACM SIGSPATIAL International Workshop on GeoStreaming
A learning theory approach to noninteractive database privacy

Journal of the ACM (JACM)
Analyzing graphs with node differential privacy

TCC'13 Proceedings of the 10th theory of cryptography conference on Theory of Cryptography
A privacy framework: indistinguishable privacy

Proceedings of the Joint EDBT/ICDT 2013 Workshops
Anonymizing sequential releases under arbitrary updates

Proceedings of the Joint EDBT/ICDT 2013 Workshops
Privacy-preserving release of aggregate dynamic models

Proceedings of the 2nd ACM international conference on High confidence networked systems
Information preservation in statistical privacy and bayesian estimation of unattributed histograms

Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Truthful mechanisms for agents that value privacy

Proceedings of the fourteenth ACM conference on Electronic commerce
Beyond worst-case analysis in private singular vector computation

Proceedings of the forty-fifth annual ACM symposium on Theory of computing
Differential privacy for the analyst via private equilibrium computation

Proceedings of the forty-fifth annual ACM symposium on Theory of computing
The geometry of differential privacy: the sparse and approximate cases

Proceedings of the forty-fifth annual ACM symposium on Theory of computing
Statistical algorithms and a lower bound for detecting planted cliques

Proceedings of the forty-fifth annual ACM symposium on Theory of computing
Membership privacy: a unifying framework for privacy definitions

Proceedings of the 2013 ACM SIGSAC conference on Computer & communications security
Faster private release of marginals on small databases

Proceedings of the 5th conference on Innovations in theoretical computer science
Understanding hierarchical methods for differentially private histograms

Proceedings of the VLDB Endowment
A near-optimal algorithm for differentially-private principal components

The Journal of Machine Learning Research
Bounds on the sample complexity for private learning and private data release

Machine Learning
A generic and distributed privacy preserving classification method with a worst-case privacy guarantee

Distributed and Parallel Databases

Quantified Score

Hi-index	0.05

Visualization

Abstract

We consider a statistical database in which a trusted administrator introduces noise to the query responses with the goal of maintaining privacy of individual database entries. In such a database, a query consists of a pair (S, f) where S is a set of rows in the database and f is a function mapping database rows to {0, 1}. The true answer is ΣiεS f(di), and a noisy version is released as the response to the query. Results of Dinur, Dwork, and Nissim show that a strong form of privacy can be maintained using a surprisingly small amount of noise -- much less than the sampling error -- provided the total number of queries is sublinear in the number of database rows. We call this query and (slightly) noisy reply the SuLQ (Sub-Linear Queries) primitive. The assumption of sublinearity becomes reasonable as databases grow increasingly large.We extend this work in two ways. First, we modify the privacy analysis to real-valued functions f and arbitrary row types, as a consequence greatly improving the bounds on noise required for privacy. Second, we examine the computational power of the SuLQ primitive. We show that it is very powerful indeed, in that slightly noisy versions of the following computations can be carried out with very few invocations of the primitive: principal component analysis, k means clustering, the Perceptron Algorithm, the ID3 algorithm, and (apparently!) all algorithms that operate in the in the statistical query learning model [11].