Hybrid rough sets intelligent system architecture for survival analysis

  • Authors:
  • Puntip Pattaraintakorn;Nick Cercone;Kanlaya Naruedomkul

  • Affiliations:
  • Department of Mathematics and Computer Science, Faculty of Science, King Mongkut's Institute of Technology Ladkrabang, Bangkok, Thailand;Faculty of Science and Engineering, York University, Toronto, Ontario, Canada;Department of Mathematics, Faculty of Science, Mahidol University, Thailand

  • Venue:
  • Transactions on rough sets VII
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Survival analysis challenges researchers because of two issues. First, in practice, the studies do not span wide enough to collect all survival times of each individual patient. All of these patients require censor variables and cannot be analyzed without special treatment. Second, analyzing risk factors to indicate the significance of the effect on survival time is necessary. Hence, we propose "Enhanced Hybrid Rough Sets Intelligent System Architecture for Survival Analysis" (Enhanced HYRIS) that can circumvent these two extra issues. Given the survival data set, Enhanced HYRIS can analyze and construct a life time table and Kaplan-Meier survival curves that account for censor variables. We employ three statistical hypothesis tests and use the p-value to identify the significance of a particular risk factor. Subsequently, rough set theory generates the probe reducts and reducts. Probe reducts and reducts include only a risk factor subset that is large enough to include all of the essential information and small enough for our survival prediction model to be created. Furthermore, in the rule induction stage we offer survival prediction models in the form of decision rules and association rules. In the validation stage, we provide cross validation with ELEM2 as well as decision tree. To demonstrate the utility of our methods, we apply Enhanced HYRIS to various data sets: geriatric, melanoma and primary biliary cirrhosis (PBC) data sets. Our experiments cover analyzing risk factors, performing hypothesis tests and we induce survival prediction models that can predict survival time efficiently and accurately.