Personal health information leak prevention in heterogeneous texts

  • Authors:
  • Marina Sokolova;Khaled El Emam;Sean Rose;Sadrul Chowdhury;Emilio Neri;Elizabeth Jonker;Liam Peyton

  • Affiliations:
  • Children's Hospital of Eastern Ontario, Ottawa, Canada;Children's Hospital of Eastern Ontario, Ottawa, Canada and University of Ottawa, Ottawa, Canada, ON;University of Ottawa, Ottawa, Canada, ON;Children's Hospital of Eastern Ontario, Ottawa, Canada;Children's Hospital of Eastern Ontario, Ottawa, Canada;Children's Hospital of Eastern Ontario, Ottawa, Canada;University of Ottawa, Ottawa, Canada, ON

  • Venue:
  • AdaptLRTtoND '09 Proceedings of the Workshop on Adaptation of Language Resources and Technology to New Domains
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

We built a system which prevents leaks of personal health information inadvertently disclosed in heterogeneous text data. The system works with free-form texts. We empirically tested the system on files gathered from peer-to-peer file exchange networks. This study presents our text analysis apparatus. We discuss adaptation of lexical sources used in medical, scientific, domain for analysis of personal health information.