Pathway-based microarray analysis for robust disease classification
Neural Computing and Applications - Special Issue on ICONIP2010
Hi-index | 0.00 |
The advances in high throughput microarray technology have enabled genome-wide expression analysis to identify diagnostic biomarkers of various disease states. In this work, muti-class classification of lung cancer data is developed based on our previous accurate and robust binary-class classification using pathway activity data. In particular, the pathway activity of each pathway was inferred using a Negatively Correlated Feature Set (NCFS) method based on curated pathway data from MSigDB, which combines pathway data of many public databases such as KEGG, PubMed, BioCarta, etc. The developed technique was tested on three independent datasets as well as a merged dataset. The results show that using a two-stage binary classification process on independent datasets provided the best performance. Nonetheless, the multi-class SVM technique also yielded acceptable results.