Comparison of deep learning models with simple method to assess the problem of antimicrobial peptides prediction.

Journal: Molecular informatics
Published Date:

Abstract

Antibiotic-resistant strains are an emerging threat to public health. The usage of antimicrobial peptides (AMPs) is one of the promising approaches to solve this problem. For the development of new AMPs, it is necessary to have reliable prediction methods. Recently, deep learning approaches have been used to predict AMP. In this paper, we want to compare simple and complex methods for these purposes. We used the BERT transformer to create sequence embeddings and the multilayer perceptron (MLP) and light attention (LA) approaches for classification. One of them reached about 80 % accuracy and specificity in benchmark testing, which is on par with the best available methods. For comparison, we proposed a simple method using only the amino acid composition of proteins or peptides. This method has shown good results, at the level of the best methods. We have prepared a special server for predicting the ability of AMPs by amino acid composition: http://bioproteom.protres.ru/antimicrob/.

Authors

  • M Y Lobanov
    Laboratory of Bioinformatics and Proteomics, Institute of Protein Research, Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia.
  • M V Slizen
    Laboratory of Bioinformatics and Proteomics, Institute of Protein Research, Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia.
  • N V Dovidchenko
    Laboratory of Bioinformatics and Proteomics, Institute of Protein Research, Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia.
  • A V Panfilov
    Laboratory of Bioinformatics and Proteomics, Institute of Protein Research, Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia.
  • A A Surin
    Faculty of Applied math, MIREA - Russian Technological University, Moscow, 119454, Russia.
  • I V Likhachev
    Laboratory of Bioinformatics and Proteomics, Institute of Protein Research, Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia.
  • O V Galzitskaya
    Laboratory of Bioinformatics and Proteomics, Institute of Protein Research, Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia.