Data Leak Prevention through Named Entity Recognition

  • Authors:
  • Jose Maria Gomez-Hidalgo;Jose Miguel Martin-Abreu;Javier Nieves;Igor Santos;Felix Brezo;Pablo G. Bringas

  • Affiliations:
  • -;-;-;-;-;-

  • Venue:
  • SOCIALCOM '10 Proceedings of the 2010 IEEE Second International Conference on Social Computing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

The rise of the social web has brought a series of privacy concerns and threats. In particular, data leakage is a risk that affects the privacy of not only companies but individuals. Although there are tools that can prevent data losses, they require a prior step that involves the sensitive data to be properly identified. In this paper, we propose a new automatic approach that applies Named Entity Recognition (NER) to prevent data leaks. We conduct an empirical study with real-world data and show that this NER-based approach can enhance the prevention of data losses. In addition, we present and detail the implementation of a prototype built with these techniques and show how it can be used by both particulars and companies in order to handle data losses.