Multilevel Bayesian networks for the analysis of hierarchical health care data

  • Authors:
  • Martijn Lappenschaar;Arjen Hommersom;Peter J. F. Lucas;Joep Lagro;Stefan Visscher

  • Affiliations:
  • Radboud University Nijmegen, Institute for Computing and Information Sciences, PO Box 9010, 6500 GL Nijmegen, The Netherlands;Radboud University Nijmegen, Institute for Computing and Information Sciences, PO Box 9010, 6500 GL Nijmegen, The Netherlands;Radboud University Nijmegen, Institute for Computing and Information Sciences, PO Box 9010, 6500 GL Nijmegen, The Netherlands;Radboud University Nijmegen Medical Centre, Department of Geriatric Medicine, PO Box 9101, 6500 HB Nijmegen, The Netherlands;Netherlands Institute for Health Services Research (NIVEL), PO Box 1568, 3500 BN Utrecht, The Netherlands

  • Venue:
  • Artificial Intelligence in Medicine
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Objective: Large health care datasets normally have a hierarchical structure, in terms of levels, as the data have been obtained from different practices, hospitals, or regions. Multilevel regression is the technique commonly used to deal with such multilevel data. However, for the statistical analysis of interactions between entities from a domain, multilevel regression yields little to no insight. While Bayesian networks have proved to be useful for analysis of interactions, they do not have the capability to deal with hierarchical data. In this paper, we describe a new formalism, which we call multilevel Bayesian networks; its effectiveness for the analysis of hierarchically structured health care data is studied from the perspective of multimorbidity. Methods: Multilevel Bayesian networks are formally defined and applied to analyze clinical data from family practices in The Netherlands with the aim to predict interactions between heart failure and diabetes mellitus. We compare the results obtained with multilevel regression. Results: The results obtained by multilevel Bayesian networks closely resembled those obtained by multilevel regression. For both diseases, the area under the curve of the prediction model improved, and the net reclassification improvements were significantly positive. In addition, the models offered considerable more insight, through its internal structure, into the interactions between the diseases. Conclusions: Multilevel Bayesian networks offer a suitable alternative to multilevel regression when analyzing hierarchical health care data. They provide more insight into the interactions between multiple diseases. Moreover, a multilevel Bayesian network model can be used for the prediction of the occurrence of multiple diseases, even when some of the predictors are unknown, which is typically the case in medicine.