Assessment for antibiotic resistance in : A practical and interpretable machine learning model based on genome-wide genetic variation.

Journal: Virulence
PMID:

Abstract

() antibiotic resistance poses a global health threat. Accurate identification of antibiotic resistant strains is essential for the control of infection. In the present study, our goal is to leverage the whole-genome data of to develop practical and interpretable machine learning (ML) models for comprehensive antibiotic resistance assessment. A total of 296 isolates with genome-wide data were downloaded from the Bacterial and Viral Bioinformatics Resource Center (BV-BRC) and the National Center for Biotechnology Information (NCBI) databases. By training ML models on feature sets of single nucleotide polymorphisms from SNP calling (SNPs-1), antibiotic-resistance SNP annotated by the Comprehensive Antibiotic Resistance Database (SNPs-2), gene presence or absence (GPA), we generated predictive models for four antibiotics and multidrug-resistance (MDR). Among them, the models that combined SNPs-1, SNPs-2, and GPA data demonstrated the best performance, with the eXtreme Gradient Boosting (XGBoost) consistently outperforming others. And then we utilized the SHapley Additive exPlanations (SHAP) method to interpret the ML models. Furthermore, a free web application for the MDR model was deployed to the GitHub repository (https://H.pylori/MDR/App/). Our study demonstrated the promise of employing whole-genome data in conjunction with ML algorithms to forecast antibiotic resistance. In the future, the application of this approach for predicting antibiotic resistance would hold the potential to mitigate the empiric administration.

Authors

  • Yingying Wang
    Department of Thyroid Surgery, China-Japan Union Hospital of Jilin University, Jilin University, Changchun, China.
  • Shuwen Zheng
    Tumor Etiology and Screening Department of Cancer Institute and General Surgery, The First Hospital of China Medical University, Shenyang, China.
  • Rui Guo
    College of Chemistry&Chemical Engineering, Xiamen University, Xiamen 361005, China.
  • Yanke Li
    Tumor Etiology and Screening Department of Cancer Institute and General Surgery, The First Hospital of China Medical University, Shenyang, China.
  • Honghao Yin
    Tumor Etiology and Screening Department of Cancer Institute and General Surgery, The First Hospital of China Medical University, Shenyang, China.
  • Xunan Qiu
    Tumor Etiology and Screening Department of Cancer Institute and General Surgery, The First Hospital of China Medical University, Shenyang, China.
  • Jijun Chen
    Tumor Etiology and Screening Department of Cancer Institute and General Surgery, The First Hospital of China Medical University, Shenyang, China.
  • Chuxuan Ni
    Tumor Etiology and Screening Department of Cancer Institute and General Surgery, The First Hospital of China Medical University, Shenyang, China.
  • Yuan Yuan
    Department of Geriatrics, Beijing Jishuitan Hospital, Capital Medical University, Beijing, China.
  • Yuehua Gong
    Tumor Etiology and Screening Department of Cancer Institute and General Surgery, The First Hospital of China Medical University, Shenyang, China.