Pathway-based microarray analysis for robust disease classification

  • Authors:
  • Pitak Sootanan;Santitham Prom-on;Asawin Meechai;Jonathan H. Chan

  • Affiliations:
  • King Mongkut’s University of Technology Thonburi, Individual Based Program (Bioinformatics), Bangkok, Thailand;King Mongkut’s University of Technology Thonburi, Department of Computer Engineering, Bangkok, Thailand;King Mongkut’s University of Technology Thonburi, Department of Chemical Engineering, Bangkok, Thailand;King Mongkut’s University of Technology Thonburi, School of Information Technology, Bangkok, Thailand

  • Venue:
  • Neural Computing and Applications - Special Issue on ICONIP2010
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

The advent of high-throughput technology has made it possible to measure genome-wide expression profiles, thus providing a new basis for microarray-based diagnosis of disease states. Numerous methods have been proposed to identify biomarkers that can accurately discriminate between case and control classes. Many of the methods used only a subset of ranked genes in the pathway and may not be able to fully represent the classification boundaries for the two disease classes. The use of negatively correlated feature sets (NCFS) to obtain more relevant features in form of phenotype-correlated genes (PCOGs) and inferring pathway activities is proposed in this study. The two pathway activity inference schemes that use NCFS significantly improved the power of pathway markers to discriminate between two phenotypes classes in microarray expression datasets of breast cancer. In particular, the NCFS-i method provided better contrasting features for classification purposes. The improvement is consistent for all cases of pathways used, using both within- and across-dataset validations. The results show that the two proposed methods that use NCFS clearly outperformed other pathway-based classifiers in terms of both ROC area and discriminative score. That is, the identification of PCOGs within each pathway, especially NCFS-i method, helps to reduce noisy or variable measurements, leading to a high performance and more robust classifier. In summary, we have demonstrated that effective incorporation of pathway information into expression-based disease diagnosis and using NCFS can provide better discriminative and more robust models.