Quantifying information leakage in document redaction

  • Authors:
  • Daniel Lopresti;A. Lawrence Spitz

  • Affiliations:
  • Lehigh University, Bethlehem, PA;DocRec Ltd., Atawhai, Nelson, New Zealand

  • Venue:
  • Proceedings of the 1st ACM workshop on Hardcopy document processing
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we examine ways in which sensitive information might leak through the process of redaction. Such attacks apply known methods from document image analysis and natural language processing to recover text thought to have been obliterated for the purposes of public release. Systematically identifying and testing these weaknesses is a first step towards designing effective countermeasures. We describe our development of a prototype semi-automated system intended to accept as input a redacted document and provide feedback to the user as to whether the document might suffer from such leaks.