Comparison of regression tree data mining methods for prediction of mortality in head injury

  • Authors:
  • Necdet Sut;Osman Simsek

  • Affiliations:
  • Department of Biostatistics and Medical Informatics, Trakya University Medical Faculty, Edirne, Turkey;Department of Neurosurgery, Trakya University Medical Faculty, Edirne, Turkey

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2011

Quantified Score

Hi-index 12.05

Visualization

Abstract

With this research, we sought to examine the performance of six different regression tree data mining methods to predict mortality in head injury. Using a data set consisting of 1603 head injury cases, we assessed the performance of: the Classification and Regression Trees (CART) method; the Chi-squared Automatic Interaction Detector (CHAID) method; the Exhaustive CHAID (E-CHAID) method; the Quick, Unbiased, Efficient Statistical Tree (QUEST) method; the Random Forest Regression and Classification (RFRC) method; and the Boosted Tree Classifiers and Regression (BTCR) method, in each case based on sensitivity, specificity, positive/negative predictive, and accuracy rates. Next, we compared their areas under the (Receiver Operating Characteristic) ROC curves. Finally, we examined whether they could be grouped in meaningful clusters with hierarchical cluster analysis. Areas under the ROC curves of regression tree data mining methods ranged from 0.801 to 0.954 (p