Discovering Compact and Highly Discriminative Features or Feature Combinations of Drug Activities Using Support Vector Machines

  • Authors:
  • Hwanjo Yu;Jiong Yang;Wei Wang;Jiawei Han

  • Affiliations:
  • -;-;-;-

  • Venue:
  • CSB '03 Proceedings of the IEEE Computer Society Conference on Bioinformatics
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

Nowadays, high throughput experimental techniques make it feasible to examine and collect massive dataat the molecular level.These data, typically mappedto a very high dimensional feature space, carry richinformation about functionalities of certain chemicalor biological entities and can be used to infer valuableknowledge for the purposes of classification and prediction.Typically, a small number of features or featurecombinations may play determinant roles in functionaldiscrimination.The identification of such features orfeature combinations is of great importance.In this paper,we study the problem of discovering compact andhighly discriminative features or feature combinationsfrom a rich feature collection.We employ the supportvector machine as the classification means and aim atfinding compact feature combinations.Comparing toprevious methods on feature selection, which identifyfeatures solely based on their individual roles in theclassification, our method is able to identify minimalfeature combinations that ultimately have determinantroles in a systematic fashion.Experimental study ondrug activity data shows that our method can discoverdescriptors that are not necessarily significant individuallybut are most significant collectively.