Temporal pattern discovery in longitudinal electronic patient records

  • Authors:
  • G. Niklas Norén;Johan Hopstadius;Andrew Bate;Kristina Star;I. Ralph Edwards

  • Affiliations:
  • WHO Collaborating Centre for International Drug Monitoring, Uppsala, Sweden and Department of Mathematics, Stockholm University, Stockholm, Sweden;WHO Collaborating Centre for International Drug Monitoring, Uppsala, Sweden;WHO Collaborating Centre for International Drug Monitoring, Uppsala, Sweden and School of Information Systems, Computing & Mathematics, Brunel University, London, UK;WHO Collaborating Centre for International Drug Monitoring, Uppsala, Sweden;WHO Collaborating Centre for International Drug Monitoring, Uppsala, Sweden

  • Venue:
  • Data Mining and Knowledge Discovery
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Large collections of electronic patient records provide a vast but still underutilised source of information on the real world use of medicines. They are maintained primarily for the purpose of patient administration, but contain a broad range of clinical information highly relevant for data analysis. While they are a standard resource for epidemiological confirmatory studies, their use in the context of exploratory data analysis is still limited. In this paper, we present a framework for open-ended pattern discovery in large patient records repositories. At the core is a graphical statistical approach to summarising and visualising the temporal association between the prescription of a drug and the occurrence of a medical event. The graphical overview contrasts the observed and expected number of occurrences of the medical event in different time periods both before and after the prescription of interest. In order to effectively screen for important temporal relationships, we introduce a new measure of temporal association, which contrasts the observed-to-expected ratio in a time period immediately after the prescription to the observed-to-expected ratio in a control period 2 years earlier. An important feature of both the observed-to-expected graph and the measure of temporal association is a statistical shrinkage towards the null hypothesis of no association, which provides protection against highlighting spurious associations. We demonstrate the usefulness of the proposed pattern discovery methodology by a set of examples from a collection of over two million patient records in the United Kingdom. The identified patterns include temporal relationships between drug prescriptions and medical events suggestive of persistent and transient risks of adverse events, possible beneficial effects of drugs, periodic co-occurrence, and systematic tendencies of patients to switch from one medication to another.