Heterologous Machine Learning for the Identification of Antimicrobial Activity in Human-Targeted Drugs.

Journal: Molecules (Basel, Switzerland)
Published Date:

Abstract

The emergence of microbes resistant to common antibiotics represent a current treat to human health. It has been recently recognized that non-antibiotic labeled drugs may promote antibiotic-resistance mechanisms in the human microbiome by presenting a secondary antibiotic activity; hence, the development of computer-assisted procedures to identify antibiotic activity in human-targeted compounds may assist in preventing the emergence of resistant microbes. In this regard, it is worth noting that while most antibiotics used to treat human infectious diseases are non-peptidic compounds, most known antimicrobials nowadays are peptides, therefore all computer-based models aimed to predict antimicrobials either use small datasets of non-peptidic compounds rendering predictions with poor reliability or they predict antimicrobial peptides that are not currently used in humans. Here we report a machine-learning-based approach trained to identify gut antimicrobial compounds; a unique aspect of our model is the use of heterologous training sets, in which peptide and non-peptide antimicrobial compounds were used to increase the size of the training data set. Our results show that combining peptide and non-peptide antimicrobial compounds rendered the best classification of gut antimicrobial compounds. Furthermore, this classification model was tested on the latest human-approved drugs expecting to identify antibiotics with broad-spectrum activity and our results show that the model rendered predictions consistent with current knowledge about broad-spectrum antibiotics. Therefore, heterologous machine learning rendered an efficient computational approach to classify antimicrobial compounds.

Authors

  • Rodrigo A Nava Lara
    Department of biochemistry and structural biology, Instituto de Fisiología Celular, UNAM, Mexico City 04510, Mexico. rnava@email.ifc.unam.mx.
  • Longendri Aguilera-Mendoza
    Computer Science Department, CICESE Research Center, Ensenada, Baja California 22860, Mexico. longendri@gmail.com.
  • Carlos A Brizuela
    Computer Science Department, CICESE Research Center, Ensenada, Baja California 22860, Mexico. cbrizuel@cicese.mx.
  • Antonio Peña
    Department of genetics, Instituto de Fisiología Celular, UNAM, Mexico City 04510, Mexico. apd@ifc.unam.mx.
  • Gabriel Del Rio
    Department of Biochemistry and Structural Biology, Instituto de Fisiologa Celular, Universidad Nacional Autónoma de México, México D. F., México. Electronic address: gdelrio@ifc.unam.mx.