Exploiting the kernel trick to correlate fragment ions for peptide identification via tandem mass spectrometry

  • Authors:
  • Yan Fu;Qiang Yang;Ruixiang Sun;Dequan Li;Rong Zeng;Charles X. Ling;Wen Gao

  • Affiliations:
  • Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080, China,;Department of Computer Science, Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong,;Graduate School of Chinese Academy of Sciences, Beijing 100039, China,;Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080, China,;Research Center for Proteome Analysis, Key Lab of Proteomics, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, ...;Department of Computer Science, The University of Western Ontario, London, Ontario, Canada N6A 5B7;Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100080, China,

  • Venue:
  • Bioinformatics
  • Year:
  • 2004

Quantified Score

Hi-index 3.85

Visualization

Abstract

Motivation: The correlation among fragment ions in a tandem mass spectrum is crucial in reducing stochastic mismatches for peptide identification by database searching. Until now, an efficient scoring algorithm that considers the correlative information in a tunable and comprehensive manner has been lacking. Results: This paper provides a promising approach to utilizing the correlative information for improving the peptide identification accuracy. The kernel trick, rooted in the statistical learning theory, is exploited to address this issue with low computational effort. The common scoring method, the tandem mass spectral dot product (SDP), is extended to the kernel SDP (KSDP). Experiments on a dataset reported previously demonstrate the effectiveness of the KSDP. The implementation on consecutive fragments shows a decrease of 10% in the error rate compared with the SDP. Our software tool, pFind, using a simple scoring function based on the KSDP, outperforms two SDP-based software tools, SEQUEST and Sonar MS/MS, in terms of identification accuracy. Supplementary Information: http://www.jdl.ac.cn/user/yfu/pfind/index.html