Differentiation of intestinal tuberculosis and Crohn's disease through an explainable machine learning method.

Journal: Scientific reports
PMID:

Abstract

Differentiation between Crohn's disease and intestinal tuberculosis is difficult but crucial for medical decisions. This study aims to develop an effective framework to distinguish these two diseases through an explainable machine learning (ML) model. After feature selection, a total of nine variables are extracted, including intestinal surgery, abdominal, bloody stool, PPD, knot, ESAT-6, CFP-10, intestinal dilatation and comb sign. Besides, we compared the predictive performance of the ML methods with traditional statistical methods. This work also provides insights into the ML model's outcome through the SHAP method for the first time. A cohort consisting of 200 patients' data (CD = 160, ITB = 40) is used in training and validating models. Results illustrate that the XGBoost algorithm outperforms other classifiers in terms of area under the receiver operating characteristic curve (AUC), sensitivity, specificity, precision and Matthews correlation coefficient (MCC), yielding values of 0.891, 0.813, 0.969, 0.867 and 0.801 respectively. More importantly, the prediction outcomes of XGBoost can be effectively explained through the SHAP method. The proposed framework proves that the effectiveness of distinguishing CD from ITB through interpretable machine learning, which can obtain a global explanation but also an explanation for individual patients.

Authors

  • Futian Weng
    School of Mathematics and Statistics, Central South University, Changsha, 410083, China.
  • Yu Meng
    Rehabilitation Medicine Center, Department of Radiology, Zhejiang Provincial People's Hospital, Affiliated People's Hospital, Hangzhou Medical College, Hangzhou 310014, Zhejiang, China.
  • Fanggen Lu
    The Gastroenterology Department of Second Xiangya Hospital, Central South University, Changsha, 410011, China.
  • Yuying Wang
    Data Mining Research Center, Xiamen University, Xiamen, 361005, Fujian, China.
  • Weiwei Wang
  • Long Xu
    Solar Activity Prediction Center, National Astronomical Observatories, Chinese Academy of Sciences, Beijing 100012, China.
  • Dongsheng Cheng
    School of Software Engineering, Shenzhen Institute of Information Technology, Shenzhen, 518172, China.
  • Jianping Zhu
    National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, 361005, Fujian, China. xmjpzhu@163.com.