Insight into the Relationships Between Chemical, Protein and Functional Variables in the PBP/GOBP Family in Moths Based on Machine Learning.

Journal: International journal of molecular sciences
PMID:

Abstract

During their lives, insects must cope with a plethora of chemicals, of which a few will have an impact at the behavioral level. To detect these chemicals, insects use several protein families located in their main olfactory organs, the antennae. Inside the antennae, odorant-binding proteins (OBPs), as the most studied protein family, bind volatile chemicals to transport them. Pheromone-binding proteins (PBPs) and general-odorant-binding proteins (GOPBs) are two subclasses of OBPs and have evolved in moths with a putative olfactory role. Predictions for OBP-chemical interactions have remained limited, and functional data collected over the years unused. In this study, chemical, protein and functional data were curated, and related datasets were created with descriptors. Regression algorithms were implemented and their performance evaluated. Our results indicate that XGBoostRegressor exhibits the best performance ( of 0.76, RMSE of 0.28 and MAE of 0.20), followed by GradientBoostingRegressor and LightGBMRegressor. To the best of our knowledge, this is the first study showing a correlation among chemical, protein and functional data, particularly in the context of the PBP/GOBP family of proteins in moths.

Authors

  • Xaviera A López-Cortés
    Department of Computer Sciences and Industries, Universidad Católica del Maule, Talca, 3480112, Chile; Centro de Innovación en Ingeniería Aplicada (CIIA), Universidad Católica del Maule, Talca, 3480112, Chile. Electronic address: xlopez@ucm.cl.
  • Gabriel Lara
    Centro de Innovación en Ingeniería Aplicada (CIIA), Universidad Católica del Maule, Talca 3466706, Chile.
  • Nicolás Fernández
    Division of Urology, Hospital Universitario San Ignacio, Pontificia Universidad Javeriana, School of Medicine, Bogotá D.C., Colombia.
  • José M Manríquez-Troncoso
    Department of Computer Sciences and Industries, Universidad Católica del Maule, Talca, 3480112, Chile.
  • Herbert Venthur
    Laboratorio de Química Ecológica, Departamento de Ciencias Químicas y Recursos Naturales, Facultad de Ingenieria y Ciencias, Universidad de La Frontera, Temuco 4811230, Chile.