Epistemic privacy

Authors:
Alexandre Evfimievski;Ronald Fagin;David Woodruff
Affiliations:
IBM Almaden Research Center, San Jose, CA;IBM Almaden Research Center, San Jose, CA;IBM Almaden Research Center, San Jose, CA
Venue:
Journal of the ACM (JACM)
Year:
2010

Citing 17
Cited 1

Combinatorics: set systems, hypergraphs, families of vectors, and combinatorial probability

Combinatorics: set systems, hypergraphs, families of vectors, and combinatorial probability
A model-theoretic analysis of knowledge

Journal of the ACM (JACM)
Reasoning about knowledge

Reasoning about knowledge
On the combinatorial and algebraic complexity of quantifier elimination

Journal of the ACM (JACM)
Revealing information while preserving privacy

Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Limiting privacy breaches in privacy preserving data mining

Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
SIGACT news complexity theory column 42

ACM SIGACT News
A formal analysis of information disclosure in data exchange

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Simulatable auditing

Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Practical privacy: the SuLQ framework

Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing)

Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing)
Towards robustness in query auditing

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Hippocratic databases

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Auditing compliance with a Hippocratic database

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Epistemic privacy

Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Structure identification of Boolean relations and plain bases for co-clones

Journal of Computer and System Sciences
Auditing SQL Queries

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering

Exploring generation of a genetic robot's personality through neural and evolutionary means

Data & Knowledge Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a novel definition of privacy in the framework of offline (retroactive) database query auditing. Given information about the database, a description of sensitive data, and assumptions about users' prior knowledge, our goal is to determine if answering a past user's query could have led to a privacy breach. According to our definition, an audited property A is private, given the disclosure of property B, if no user can gain confidence in A by learning B, subject to prior knowledge constraints. Privacy is not violated if the disclosure of B causes a loss of confidence in A. The new notion of privacy is formalized using the well-known semantics for reasoning about knowledge, where logical properties correspond to sets of possible worlds (databases) that satisfy these properties. Database users are modeled as either possibilistic agents whose knowledge is a set of possible worlds, or as probabilistic agents whose knowledge is a probability distribution on possible worlds. We analyze the new privacy notion, show its relationship with the conventional approach, and derive criteria that allow the auditor to test privacy efficiently in some important cases. In particular, we prove characterization theorems for the possibilistic case, and study in depth the probabilistic case under the assumption that all database records are considered a-priori independent by the user, as well as under more relaxed (or absent) prior-knowledge assumptions. In the probabilistic case we show that for certain families of distributions there is no efficient algorithm to test whether an audited property A is private given the disclosure of a property B, assuming P ≠ NP. Nevertheless, for many interesting families, such as the family of product distributions, we obtain algorithms that are efficient both in theory and in practice.