Machine learning models based on routine blood and biochemical test data for diagnosis of neurological diseases.

Journal: Scientific reports
Published Date:

Abstract

Globally, nervous system diseases are the leading cause of disability-adjusted life-years and the second leading cause of mortality in the world. Traditional diagnostic methods for nervous system diseases are expensive. So this study aimed to construct machine learning models using the convenient blood routine and biochemical detection data for diagnosis of nervous system diseases. After the data preprocessing, 25,794 healthy people and 7518 nervous system disease patients with the blood routine and biochemical detection data were utilized for our study. We selected logistic regression, random forest, support vector machine, eXtreme Gradient Boosting (XGBoost), and deep neural network to construct models. Finally, the SHAP algorithm was used to interpret models. The nervous system disease prediction model constructed by XGBoost possessed the best performance (AUC: 0.9782). And the most models of distinguishing various nervous system diseases also had good performance, the model performance of distinguishing neuromyelitis optica from other nervous system diseases was the best (AUC: 0.9095). The model interpretation by SHAP algorithm indicated features from biochemical detection made major contributions to predicting nervous system disease. The present study constructed multiple models using 52 features from the blood routine and biochemical detection data for diagnosis of various nervous system diseases. Meanwhile, distinct hematologic features of various nervous system diseases also were explored. This cost-effective work will benefit more people and assist in diagnosis and prevention of nervous system diseases.

Authors

  • Wanshan Ning
    Department of Bioinformatics and Systems Biology, Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China.
  • Zhicheng Wang
    Department of Neurosurgery, National Regional Medical Center, Binhai Campus of the First Affiliated Hospital, Fujian Medical University, Fuzhou, Fujian, China.
  • Ying Gu
    Department of Radiation Oncology, Jinling Hospital, Nanjing, Jiangsu, 210002, China.
  • Lindan Huang
    Department of Laboratory Medicine, Xiamen Key Laboratory of Genetic Testing, the First Affiliated Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen, 361003, Fujian, China.
  • Shuai Liu
    Graduate School of Chinese Academy of Traditional Chinese Medicine, Beijing, China.
  • Qun Chen
    Shanghai United Imaging Healthcare Co., Ltd, Shanghai 201807, People's Republic of China.
  • Yunyun Yang
    Guangdong Provincial Key Laboratory of Chemical Measurement and Emergency Test Technology, Guangdong Provincial Engineering Research Center for Ambient Mass Spectrometry, Institute of Analysis, Guangdong Academy of Sciences (China National Analytical Center, Guangzhou), Guangzhou, 510070, China.
  • Guolin Hong
    Department of Laboratory Medicine, Xiamen Key Laboratory of Genetic Testing, the First Affiliated Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen, 361003, Fujian, China. xmhgl9899@xmu.edu.cn.