Comparing machine learning with case-control models to identify confirmed dengue cases.

Journal: PLoS neglected tropical diseases
PMID:

Abstract

In recent decades, the global incidence of dengue has increased. Affected countries have responded with more effective surveillance strategies to detect outbreaks early, monitor the trends, and implement prevention and control measures. We have applied newly developed machine learning approaches to identify laboratory-confirmed dengue cases from 4,894 emergency department patients with dengue-like illness (DLI) who received laboratory tests. Among them, 60.11% (2942 cases) were confirmed to have dengue. Using just four input variables [age, body temperature, white blood cells counts (WBCs) and platelets], not only the state-of-the-art deep neural network (DNN) prediction models but also the conventional decision tree (DT) and logistic regression (LR) models delivered performances with receiver operating characteristic (ROC) curves areas under curves (AUCs) of the ranging from 83.75% to 85.87% [for DT, DNN and LR: 84.60% ± 0.03%, 85.87% ± 0.54%, 83.75% ± 0.17%, respectively]. Subgroup analyses found all the models were very sensitive particularly in the pre-epidemic period. Pre-peak sensitivities (<35 weeks) were 92.6%, 92.9%, and 93.1% in DT, DNN, and LR respectively. Adjusted odds ratios examined with LR for low WBCs [≤ 3.2 (x103/μL)], fever (≥38°C), low platelet counts [< 100 (x103/μL)], and elderly (≥ 65 years) were 5.17 [95% confidence interval (CI): 3.96-6.76], 3.17 [95%CI: 2.74-3.66], 3.10 [95%CI: 2.44-3.94], and 1.77 [95%CI: 1.50-2.10], respectively. Our prediction models can readily be used in resource-poor countries where viral/serologic tests are inconvenient and can also be applied for real-time syndromic surveillance to monitor trends of dengue cases and even be integrated with mosquito/environment surveillance for early warning and immediate prevention/control measures. In other words, a local community hospital/clinic with an instrument of complete blood counts (including platelets) can provide a sentinel screening during outbreaks. In conclusion, the machine learning approach can facilitate medical and public health efforts to minimize the health threat of dengue epidemics. However, laboratory confirmation remains the primary goal of surveillance and outbreak investigation.

Authors

  • Tzong-Shiann Ho
    Department of Pediatrics, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan, Taiwan, Republic of China.
  • Ting-Chia Weng
    Department of Occupational and Environmental Medicine, National Cheng Kung University Hospital, Tainan, Taiwan, Republic of China.
  • Jung-Der Wang
    Department of Public Health, College of Medicine, National Cheng Kung University, Tainan City, Taiwan.
  • Hsieh-Cheng Han
    Research Center for Applied Sciences, Academia Sinica, Taipei, Taiwan, Republic of China.
  • Hao-Chien Cheng
    Institute of Biomedical Electronics and Bioinformatics, College of Electrical Engineering & Computer Science, National Taiwan University, Taipei, Taiwan, Republic of China.
  • Chun-Chieh Yang
    Institute of Biomedical Electronics and Bioinformatics, College of Electrical Engineering & Computer Science, National Taiwan University, Taipei, Taiwan, Republic of China.
  • Chih-Hen Yu
    Department of Internal Medicine, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan, Taiwan, Republic of China.
  • Yen-Jung Liu
    Institute of Biomedical Electronics and Bioinformatics, College of Electrical Engineering & Computer Science, National Taiwan University, Taipei, Taiwan, Republic of China.
  • Chien Hsiang Hu
    Department of Medical Informatics, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan, Taiwan, Republic of China.
  • Chun-Yu Huang
    Institute of Biomedical Electronics and Bioinformatics, College of Electrical Engineering & Computer Science, National Taiwan University, Taipei, Taiwan, Republic of China.
  • Ming-Hong Chen
    Department of Medical Informatics, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan, Taiwan, Republic of China.
  • Chwan-Chuen King
    Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan, Republic of China.
  • Yen-Jen Oyang
    Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, Taipei, 10617, Taiwan.
  • Ching-Chuan Liu
    Department of Pediatrics, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan, Taiwan, Republic of China.