IAMPE: NMR-Assisted Computational Prediction of Antimicrobial Peptides.

Journal: Journal of chemical information and modeling
Published Date:

Abstract

Antimicrobial peptides (AMPs) are at the focus of attention due to their therapeutic importance and developing computational tools for the identification of efficient antibiotics from the primary structure. Here, we utilized the CNMR spectral of amino acids and clustered them into various groups. These clusters were used to build feature vectors for the AMP sequences based on the composition, transition, and distribution of cluster members. These features, along with the physicochemical properties of AMPs were exploited to learn computational models to predict active AMPs solely from their sequences. Naïve Bayes (NB), k-nearest neighbors (KNN), support-vector machine (SVM), random forest (RF), and eXtreme Gradient Boosting (XGBoost) were employed to build the classification system using the collected AMP datasets from the CAMP, LAMP, ADAM, and AntiBP databases. Our results were validated and compared with the CAMP and ADAM prediction systems and indicated that the synergistic combination of the CNMR features with the physicochemical descriptors enables the proposed ensemble mechanism to improve the prediction performance of active AMP sequences. Our web-based AMP prediction platform, IAMPE, is available at http://cbb1.ut.ac.ir/.

Authors

  • Kaveh Kavousi
    Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran.
  • Mojtaba Bagheri
    Peptide Chemistry Laboratory, Department of Biochemistry, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14335, Iran.
  • Saman Behrouzi
    Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran.
  • Safar Vafadar
    Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran.
  • Fereshteh Fallah Atanaki
    Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran.
  • Bahareh Teimouri Lotfabadi
    Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran.
  • Shohreh Ariaeenejad
    Department of Systems Biology, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research Education and Extension Organization (AREO), Karaj, Iran.
  • Abbas Shockravi
    Faculty of Chemistry, Kharazmi University, Tehran 14911-15719, Iran.
  • Ali Akbar Moosavi-Movahedi
    Department of Biophysics, Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran. moosavi@ut.ac.ir.