A practical approach to feature selection
ML92 Proceedings of the ninth international workshop on Machine learning
Fast training of support vector machines using sequential minimal optimization
Advances in kernel methods
Chi2: Feature Selection and Discretization of Numeric Attributes
TAI '95 Proceedings of the Seventh International Conference on Tools with Artificial Intelligence
Hi-index | 0.00 |
There has been much active research in bioinformatics to support our understanding of oncogenesis and tumor progression. Most research relies on mRNA gene expression data to identify marker genes or cancer specific gene networks. However, considering that proteins are functional molecules that carry out the biological tasks of genes, they can be direct markers of biological functions. Protein abundance data on a genome scale have not been investigated in depth due to the limited availability of high throughput protein assays. This hindrance is chiefly caused by a lack of robust techniques such as RT-PCR (real-time polymerase chain reaction). In this study, we quantified phospho-proteomes of breast cancer cell lines treated with TGF-beta (transforming growth factor beta). To discover biomarkers and observe changes in the signaling pathways related to breast cancer, we applied a protein network-based approach to generate a classifier of subnet markers. The accuracy of that classifier outperformed other network-based classification algorithms, and current feature selection and classification algorithms. Moreover, many cancer-related proteins were identified in those sub-networks. Each sub-network provides functional insights and can serve as a potential marker for TGF-beta treatments. After interpreting the roles of proteins in sub-networks with various signaling pathways, we found strong candidate proteins and various related interactions that are expected to affect breast cancer outcomes. These results demonstrate the high quality of the quantified phospho-proteomes data and show that our network construction and classification method is appropriate for an analysis of this type of data.