Leveraging viral genome sequences and machine learning models for identification of potentially selective antiviral agents.

Journal: Communications chemistry
Published Date:

Abstract

Viral genome sequencing provides valuable information for antiviral development, yet its integration with machine learning for virtual screening remains underexplored. To bridge this gap, viral genome sequences were combined with structural data of approved and investigational antivirals to identify virus-selective agents. In parallel, quantitative structure-activity relationship (QSAR) models were built to predict pan-antivirals. Robust models were generated with the area under the receiver operating characteristic curve (AUC-ROC) >0.72 for virus-selective and >0.79 for pan-antiviral predictions. These models were applied to virtually screen ~360 K compounds for anti-SARS-CoV-2 activity. The 346 compounds identified by the models were tested using two in vitro assays, yielding hit rates of 9.4% (24/256) in the pseudotyped particle (PP) entry assay and 37% (47/128) in the RNA-dependent RNA polymerase (RdRp) assay. The top compounds showed potencies around 1 µM. This study provides a framework for virtual screening of virus-selective and pan- antivirals against emerging pathogens.

Authors

  • Tuan Xu
    Division of Preclinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, Maryland 20850, United States.
  • Miao Xu
    Intensive Care Unit, Critical Care Medicine, Affiliated Hongqi Hospital of Mudanjiang Medical University, Mudanjiang 157011, Heilongjiang, China.
  • Qi Zhang
    Department of Gastroenterology, The Affiliated Hospital of Qingdao University, Qingdao, China.
  • Catherine Z Chen
    National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Dr., Rockville, MD 20850, USA.
  • Wei Zheng
    School of Computer Engineering, Jinling Institute of Technology, Nanjing, 211169, China. zhengwei@jit.edu.cn.
  • Ruili Huang
    National Center for Advancing Translational Sciences (NCATS) , National Institutes of Health , 9800 Medical Center Drive , Rockville , Maryland 20850 , United States.

Keywords

No keywords available for this article.