Identification of a 10-species microbial signature of inflammatory bowel disease by machine learning and external validation.

Journal: Cell regeneration (London, England)
Published Date:

Abstract

Genetic and microbial factors influence inflammatory bowel disease (IBD), prompting our study on non-invasive biomarkers for enhanced diagnostic precision. Using the XGBoost algorithm and variable analysis and the published metadata, we developed the 10-species signature XGBoost classification model (XGB-IBD10). By using distinct species signatures and prior machine and deep learning models and employing standardization methods to ensure comparability between metagenomic and 16S sequencing data, we constructed classification models to assess the XGB-IBD10 precision and effectiveness. XGB-IBD10 achieved a notable accuracy of 0.8722 in testing samples. In addition, we generated metagenomic sequencing data from collected 181 stool samples to validate our findings, and the model reached an accuracy of 0.8066. The model's performance significantly improved when trained on high-quality data from the Chinese population. Furthermore, the microbiome-based model showed promise in predicting active IBD. Overall, this study identifies promising non-invasive biomarkers associated with IBD, which could greatly enhance diagnostic accuracy.

Authors

  • Shicheng Yu
    Guangzhou National Laboratory, Guangzhou, 510005, China.
  • Jun Li
    Department of Emergency, Zhuhai Integrated Traditional Chinese and Western Medicine Hospital, Zhuhai, 519020, Guangdong Province, China. quanshabai43@163.com.
  • Zhaofeng Ye
    MOE Key Laboratory of Bioinformatics, School of Medicine, Tsinghua University, Beijing, 100084, China.
  • Mengxian Zhang
    Department of Oncology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.
  • Xiaohua Guo
    College of Translation Studies, Xi'an Fanyi University, Xi'an 710105, China.
  • Xu Wang
    Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN 47907.
  • Liansheng Liu
  • Yalong Wang
    School of Medicine, Shaan'xi Province, Xi'an Jiaotong University, Xi'an, 710061, China.
  • Xin Zhou
    School of Mechatronic Engineering, China University of Mining & Technology, Xuzhou 221116, China.
  • Wei Fu
    Department of Information Security, Naval University of Engineering, Wuhan, China.
  • Michael Q Zhang
    Department of Biological Sciences, Center for Systems Biology.
  • Ye-Guang Chen
    Guangzhou National Laboratory, Guangzhou, 510005, China. ygchen@tsinghua.edu.cn.

Keywords

No keywords available for this article.