Software systems for tabular data releases

  • Authors:
  • Adrian Dobra;Alan F. Karr;Ashish P. Sanil;Stephen E. Fienberg

  • Affiliations:
  • National Institute of Statistical Sciences, PO Box 14006, Research Triangle Park, NC;National Institute of Statistical Sciences, PO Box 14006, Research Triangle Park, NC;National Institute of Statistical Sciences, PO Box 14006, Research Triangle Park, NC;Department of Statistics, Carnegie Mellon University, Pittsburg, PA

  • Venue:
  • International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

We describe two classes of software systems that release tabular summaries of an underlying database. Table servers respond to user queries for (marginal) sub-tables of the "full" table summarizing the entire database, and are characterized by dynamic assessment of disclosure risk, in light of previously answered queries. Optimal tabular releases are static releases of sets of sub-tables that are characterized by maximizing the amount of information released, as given by a measure of data utility, subject to a constraint on disclosure risk. Underlying abstractions - primarily associated with the query space, as well as released and unreleasable sub-tables and frontiers, computational algorithms and issues, especially scalability, and prototype software implementations are discussed.