Using different machine learning models to classify patients into mild and severe cases of COVID-19 based on multivariate blood testing.

Journal: Journal of medical virology
PMID:

Abstract

COVID-19 is a serious respiratory disease. The ever-increasing number of cases is causing heavier loads on the health service system. Using 38 blood test indicators on the first day of admission for the 422 patients diagnosed with COVID-19 (from January 2020 to June 2021) to construct different machine learning (ML) models to classify patients into either mild or severe cases of COVID-19. All models show good performance in the classification between COVID-19 patients into mild and severe disease. The area under the curve (AUC) of the random forest model is 0.89, the AUC of the naive Bayes model is 0.90, the AUC of the support vector machine model is 0.86, and the AUC of the KNN model is 0.78, the AUC of the Logistic regression model is 0.84, and the AUC of the artificial neural network model is 0.87, among which the naive Bayes model has the best performance. Different ML models can classify patients into mild and severe cases based on 38 blood test indicators taken on the first day of admission for patients diagnosed with COVID-19.

Authors

  • Rui-Kun Zhang
    Health Science Center, Shenzhen University, Shenzhen, China.
  • Qi Xiao
    Mallinckrodt Institute of Radiology, Washington University School of Medicine in St. Louis, St. Louis, MO 63110, United States.
  • Sheng-Lang Zhu
    Department of nephrology, Shenzhen Nanshan People's Hospital and The 6th Affiliated Hospital of Shenzhen University Health Science Center, Shenzhen, China.
  • Hai-Yan Lin
    Department of nephrology, Shenzhen Nanshan People's Hospital and The 6th Affiliated Hospital of Shenzhen University Health Science Center, Shenzhen, China.
  • Ming Tang
    Business School, Sichuan University, Chengdu 610064, China. tangming0716@163.com.