A Semi-Supervised Learning Approach to Integrated Salient Risk Features for Bone Diseases

  • Authors:
  • Hui Li;Xiaoyi Li;Murali Ramanathan;Aidong Zhang

  • Affiliations:
  • Department of Computer Science and Engineering, State University of New York at Buffalo, USA;Department of Computer Science and Engineering, State University of New York at Buffalo, USA;Department of Pharmaceutical Sciences, State University of New York at Buffalo, USA;Department of Computer Science and Engineering, State University of New York at Buffalo, USA

  • Venue:
  • Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

The study of the risk factor analysis and prediction for diseases requires the understanding of the complicated and highly correlated relationships behind numerous potential risk factors (RFs). The existing models for this purpose usually fix a small number of RFs based on the expert knowledge. Although handcrafted RFs are usually statistically significant, those abandoned RFs might still contain valuable information for explaining the comprehensiveness of a disease. However, it is impossible to simply keep all of RFs. So how to find the integrated risk features from numerous potential RFs becomes a particular challenging task. Another major challenge for this task is the lack of sufficient labeled data and missing values in the training data. In this paper, we focus on the identification of the relationships between a bone disease and its potential risk factors by learning a deep graphical model in an epidemiologic study for the purpose of predicting osteoporosis and bone loss. An effective risk factor analysis approach which delineates both observed and hidden risk factors behind a disease encapsulates the salient features and also provides a framework for two prediction tasks. Specifically, we first investigate an approach to show the salience of the integrated risk features yielding more abstract and useful representations for the prediction. Then we formulate the whole prediction problem as two separate tasks to evaluate our new representation of integrated features. With the success of the osteoporosis prediction, we further take advantage of the Positive output and predict the progression trend of osteoporosis severity. We capture the characteristics of data itself and intrinsic relatedness between two relevant prediction tasks by constructing a deep belief network followed with a two-stage fine-tuning (FT). Moreover, our proposed method results in stable and promising results without using any prior information. The superior performance on our evaluation metrics confirms the effectiveness of the proposed approach for extraction of the integrated salient risk features for predicting bone diseases.