Protecting Sensitive Topics in Text Documents with PROTEXTOR

  • Authors:
  • Chad Cumby

  • Affiliations:
  • Accenture Technology Labs, Chicago, USA 60601

  • Venue:
  • ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This is a demonstration of a system for protecting sensitive topics present in text documents. Our system works in a privacy framework where the topic is characterized as a multiclass classification problem in a generative setting. We show how our system helps a user redact a document in a business setting to obscure what company the text pertains to, and show some experimental results on redacting the topic for a standard text classification data set.