BagReg: Protein inference through machine learning.

Journal: Computational biology and chemistry
Published Date:

Abstract

Protein inference from the identified peptides is of primary importance in the shotgun proteomics. The target of protein inference is to identify whether each candidate protein is truly present in the sample. To date, many computational methods have been proposed to solve this problem. However, there is still no method that can fully utilize the information hidden in the input data. In this article, we propose a learning-based method named BagReg for protein inference. The method firstly artificially extracts five features from the input data, and then chooses each feature as the class feature to separately build models to predict the presence probabilities of proteins. Finally, the weak results from five prediction models are aggregated to obtain the final result. We test our method on six public available data sets. The experimental results show that our method is superior to the state-of-the-art protein inference algorithms.

Authors

  • Can Zhao
    Ethnic Medical School, Chengdu University of Traditional Chinese Medicine, Chengdu 611131, China.
  • Dao Liu
    Baidu.com, Inc., No. 10, Shangdi 10th Street, Haidian District, Beijing, China.
  • Ben Teng
    School of Software, Dalian University of Technology, Dalian, China.
  • Zengyou He
    School of Software, Dalian University of Technology, Dalian, China. Electronic address: zyhe@dlut.edu.cn.