Gut microbiome-based supervised machine learning for clinical diagnosis of inflammatory bowel diseases.

Journal: American journal of physiology. Gastrointestinal and liver physiology
Published Date:

Abstract

Despite the availability of various diagnostic tests for inflammatory bowel diseases (IBD), misdiagnosis of IBD occurs frequently, and thus, there is a clinical need to further improve the diagnosis of IBD. As gut dysbiosis is reported in patients with IBD, we hypothesized that supervised machine learning (ML) could be used to analyze gut microbiome data for predictive diagnostics of IBD. To test our hypothesis, fecal 16S metagenomic data of 729 subjects with IBD and 700 subjects without IBD from the American Gut Project were analyzed using five different ML algorithms. Fifty differential bacterial taxa were identified [linear discriminant analysis effect size (LEfSe): linear discriminant analysis (LDA) score > 3] between the IBD and non-IBD groups, and ML classifications trained with these taxonomic features using random forest (RF) achieved a testing area under the receiver operating characteristic curves (AUC) of ∼0.80. Next, we tested if operational taxonomic units (OTUs), instead of bacterial taxa, could be used as ML features for diagnostic classification of IBD. Top 500 high-variance OTUs were used for ML training, and an improved testing AUC of ∼0.82 (RF) was achieved. Lastly, we tested if supervised ML could be used for differentiating Crohn's disease (CD) and ulcerative colitis (UC). Using 331 CD and 141 UC samples, 117 differential bacterial taxa (LEfSe: LDA score > 3) were identified, and the RF model trained with differential taxonomic features or high-variance OTU features achieved a testing AUC > 0.90. In summary, our study demonstrates the promising potential of artificial intelligence via supervised ML modeling for predictive diagnostics of IBD using gut microbiome data. Our study demonstrates the promising potential of artificial intelligence via supervised machine learning modeling for predictive diagnostics of different types of inflammatory bowel diseases using fecal gut microbiome data.

Authors

  • Ishan Manandhar
    Center for Hypertension and Precision Medicine, Department of Physiology and Pharmacology, University of Toledo College of Medicine and Life Sciences, Toledo, OH, USA.
  • Ahmad Alimadadi
    Center for Hypertension and Precision Medicine, Program in Physiological Genomics, Department of Physiology and Pharmacology, University of Toledo College of Medicine and Life Sciences, Toledo, Ohio.
  • Sachin Aryal
    Center for Hypertension and Precision Medicine, Department of Physiology and Pharmacology, University of Toledo College of Medicine and Life Sciences, Toledo, OH, USA.
  • Patricia B Munroe
    Center for Hypertension and Precision Medicine, Program in Physiological Genomics, Department of Physiology and Pharmacology, University of Toledo College of Medicine and Life Sciences, Toledo, Ohio.
  • Bina Joe
    Center for Hypertension and Precision Medicine, Department of Physiology and Pharmacology, University of Toledo College of Medicine and Life Sciences, Toledo, OH, USA.
  • Xi Cheng
    Genes, Cognition, and Psychosis Program, National Institute of Mental Health, National Institutes of HealthBethesda, MD, USA; The Lieber Institute for Brain DevelopmentBaltimore, MD, USA; Bioinformatics and Computational Biosciences Branch, Office of Cyber Infrastructure and Computational Biology (OCICB), National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of HealthRockville, MD, USA.