A hybrid machine-learning approach for segmentation of protein localization data

  • Authors:
  • Peter M. Kasson;Johannes B. Huppa;Mark M. Davis;Axel T. Brunger

  • Affiliations:
  • Biophysics Program, Stanford Synchrotron Radiation Laboratory, Stanford University Stanford, CA 94305, USA;Howard Hughes Medical Institute Stanford, CA 94305, USA;Howard Hughes Medical Institute Stanford, CA 94305, USA;Department of Molecular and Cellular Physiology, Stanford Synchrotron Radiation Laboratory, Stanford University Stanford, CA 94305, USA

  • Venue:
  • Bioinformatics
  • Year:
  • 2005

Quantified Score

Hi-index 3.84

Visualization

Abstract

Motivation: Subcellular protein localization data are critical to the quantitative understanding of cellular function and regulation. Such data are acquired via observation and quantitative analysis of fluorescently labeled proteins in living cells. Differentiation of labeled protein from cellular artifacts remains an obstacle to accurate quantification. We have developed a novel hybrid machine-learning-based method to differentiate signal from artifact in membrane protein localization data by deriving positional information via surface fitting and combining this with fluorescence-intensity-based data to generate input for a support vector machine. Results: We have employed this classifier to analyze signaling protein localization in T-cell activation. Our classifier displayed increased performance over previously available techniques, exhibiting both flexibility and adaptability: training on heterogeneous data yielded a general classifier with good overall performance; training on more specific data cyielded an extremely high-performance specific classifier. We also demonstrate accurate automated learning utilizing additional experimental data. Availability: http://atb.slac.stanford.edu/~kasson/membraneclassifier.html Contact: brunger@stanford.edu Supplementary information: http://atb.slac.stanford.edu/~kasson/classifier_suppl.pdf