Unsupervised pattern discovery in electronic health care data using probabilistic clustering models

  • Authors:
  • Benjamin M. Marlin;David C. Kale;Robinder G. Khemani;Randall C. Wetzel

  • Affiliations:
  • University of Massachusetts Amherst, Amherst, MA, USA;Children's Hospital Los Angeles, Los Angeles, CA, USA;Children's Hospital Los Angeles, Los Angeles, CA, USA;Children's Hospital Los Angeles, Los Angeles, CA, USA

  • Venue:
  • Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Bedside clinicians routinely identify temporal patterns in physiologic data in the process of choosing and administering treatments intended to alter the course of critical illness for individual patients. Our primary interest is the study of unsupervised learning techniques for automatically uncovering such patterns from the physiologic time series data contained in electronic health care records. This data is sparse, high-dimensional and often both uncertain and incomplete. In this paper, we develop and study a probabilistic clustering model designed to mitigate the effects of temporal sparsity inherent in electronic health care records data. We evaluate the model qualitatively by visualizing the learned cluster parameters and quantitatively in terms of its ability to predict mortality outcomes associated with patient episodes. Our results indicate that the model can discover distinct, recognizable physiologic patterns with prognostic significance.