VirusImmu: a novel ensemble machine learning approach for viral immunogenicity prediction.

Journal: Briefings in functional genomics
PMID:

Abstract

The viruses threats provoke concerns regarding their sustained epidemic transmission, making the development of vaccines particularly important. In the prolonged and costly process of vaccine development, the most important initial step is to identify protective immunogens. Machine learning (ML) approaches are productive in analyzing big data such as microbial proteomes, and can remarkably reduce the cost of experimental work in developing novel vaccine candidates. We intensively evaluated the B cell epitope immunogenicity prediction power of eight commonly-used ML methods by random sampling cross validation on a large dataset consisting of known viral immunogens and non-immunogens we manually curated from the public domain. Extreme Gradient Boosting, K Nearest Neighbours, and Random Forest) showed the strongest predictive power. We then proposed a novel soft-voting based ensemble approach (VirusImmu), which demonstrated a powerful and stable capability for viral immunogenicity prediction across the test set and external test set irrespective of protein sequence length. VirusImmu was successfully applied to facilitate identifying linear B cell epitopes against African Swine Fever Virus as confirmed by indirect ELISA in vitro. In short, VirusImmu exhibited tremendous potentials in predicting immunogenicity of viral protein segments. It is freely accessible at https://github.com/zhangjbig/VirusImmu.

Authors

  • Jing Li
    Department of Neurosurgery, Tianjin Medical University General Hospital, Tianjin, China.
  • Zhongpeng Zhao
    Department of Immunology, School of Basic Medical Sciences, Anhui Medical University, No 81 Meishan Road, Shushan District, Hefei 230032, China.
  • ChengZheng Tai
    Key Laboratory for Biomechanics and Mechanobiology of Ministry of Education, Beijing Advanced Innovation Centre for Biomedical Engineering, School of Engineering Medicine, School of Biological Science and Medical Engineering, Beihang University, 37 Xueyuan Road, Haidian Distirct, Beijing 100083, P. R. China.
  • Ting Sun
    Key Laboratory for Biomechanics and Mechanobiology of Ministry of Education, Beijing Advanced Innovation Centre for Biomedical Engineering, School of Engineering Medicine, School of Biological Science and Medical Engineering, Beihang University, No.37 Xueyuan Road, Haidian District, Beijing, 100083, People's Republic of China.
  • Lingyun Tan
    Department of Immunology, School of Basic Medical Sciences, Anhui Medical University, No 81 Meishan Road, Shushan District, Hefei 230032, China.
  • Xinyu Li
    School of Pharmacy, Binzhou Medical University, Yantai, China.
  • Wei He
    Department of Orthopaedics Surgery, First Affiliated Hospital of Guangzhou University of Traditional Chinese Medicine, Guangzhou, China.
  • Hongjun Li
    School of Agricultural Engineering and Food Science, Shandong University of Technology, Zhangdian District, No. 12, Zhangzhou Road, Zibo, Shandong Province, China.
  • Jing Zhang
    MOEMIL Laboratory, School of Optoelectronic Information, University of Electronic Science and Technology of China, Chengdu, China.