Using sample size to limit exposure to data mining

  • Authors:
  • Chris Clifton

  • Affiliations:
  • -

  • Venue:
  • Journal of Computer Security - Special issue on database security
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data mining introduces new problems in database security. The basicproblem of using non-sensitive data to infer sensitive data is mademore difficult by the "probabilistic" inferences possible with datamining. This paper shows how lower bounds from pattern recognitiontheory can be used to determine sample sizes where data miningtools cannot obtain reliable results.