Secure statistical databases with random sample queries

  • Authors:
  • Dorothy E. Denning

  • Affiliations:
  • Purdue Univ., West Lafayette, IN

  • Venue:
  • ACM Transactions on Database Systems (TODS)
  • Year:
  • 1980

Quantified Score

Hi-index 0.03

Visualization

Abstract

A new inference control, called random sample queries, is proposed for safeguarding confidential data in on-line statistical databases. The random sample queries control deals directly with the basic principle of compromise by making it impossible for a questioner to control precisely the formation of query sets. Queries for relative frequencies and averages are computed using random samples drawn from the query sets. The sampling strategy permits the release of accurate and timely statistics and can be implemented at very low cost. Analysis shows the relative error in the statistics decreases as the query set size increases; in contrast, the effort required to compromise increases with the query set size due to large absolute errors. Experiments performed on a simulated database support the analysis.