Deep Learning Powers Protein Identification from Precursor MS Information.

Journal: Journal of proteome research
PMID:

Abstract

Proteome analysis currently heavily relies on tandem mass spectrometry (MS/MS), which does not fully utilize MS1 features, as many precursors remain unselected for MS/MS fragmentation, especially in the cases of low abundance samples and wide abundance dynamic range samples. Therefore, leveraging MS1 features as a complement to MS/MS has become an attractive option to improve the coverage of feature identification. Herein, we propose MonoMS1, an approach combining deep learning-based retention time, ion mobility, detectability prediction, and logistic regression-based scoring for MS1 feature identification. The approach achieved a significant increase in MS1 feature identification based on an data set. Application of MonoMS1 to data sets with wide dynamic range, such as human serum proteome samples, and with low sample abundance, such as single-cell proteome samples, enabled substantial complementation of MS/MS-based peptide and protein identification. This method opens a new avenue for proteomic analysis and can boost proteomic research on complex samples.

Authors

  • Yameng Dai
    Department of Chemistry, and Shanghai Stomatological Hospital, Fudan University, Shanghai 200000, China.
  • Yi Yang
    Department of Orthopedics, Orthopedic Research Institute, West China Hospital, Sichuan University, Chengdu, Sichuan, China.
  • Enhui Wu
    Department of Chemistry, and Shanghai Stomatological Hospital, Fudan University, Shanghai 200000, China.
  • Chengpin Shen
    Shanghai Omicsolution Co., Ltd., Shanghai, 200000, China.
  • Liang Qiao
    Department of Chemistry, Shanghai Stomatological Hospital, and Institutes of Biomedical Sciences, Fudan University, Shanghai, 200000, China. liang_qiao@fudan.edu.cn.