A machine learning algorithm to increase COVID-19 inpatient diagnostic capacity.

Journal: PloS one
Published Date:

Abstract

Worldwide, testing capacity for SARS-CoV-2 is limited and bottlenecks in the scale up of polymerase chain reaction (PCR-based testing exist. Our aim was to develop and evaluate a machine learning algorithm to diagnose COVID-19 in the inpatient setting. The algorithm was based on basic demographic and laboratory features to serve as a screening tool at hospitals where testing is scarce or unavailable. We used retrospectively collected data from the UCLA Health System in Los Angeles, California. We included all emergency room or inpatient cases receiving SARS-CoV-2 PCR testing who also had a set of ancillary laboratory features (n = 1,455) between 1 March 2020 and 24 May 2020. We tested seven machine learning models and used a combination of those models for the final diagnostic classification. In the test set (n = 392), our combined model had an area under the receiver operator curve of 0.91 (95% confidence interval 0.87-0.96). The model achieved a sensitivity of 0.93 (95% CI 0.85-0.98), specificity of 0.64 (95% CI 0.58-0.69). We found that our machine learning algorithm had excellent diagnostic metrics compared to SARS-CoV-2 PCR. This ensemble machine learning algorithm to diagnose COVID-19 has the potential to be used as a screening tool in hospital settings where PCR testing is scarce or unavailable.

Authors

  • David Goodman-Meza
    Division of Infectious Diseases, David Geffen School of Medicine at UCLA, Los Angeles, California, United States of America.
  • Akos Rudas
    Department of Computational Medicine, UCLA, Los Angeles, California, United States of America.
  • Jeffrey N Chiang
    Department of Computational Medicine, UCLA, Los Angeles, California, United States of America.
  • Paul C Adamson
    Division of Infectious Diseases, David Geffen School of Medicine at UCLA, Los Angeles, California, United States of America.
  • Joseph Ebinger
    Department of Cardiology, Cedars-Sinai Medical Center, Los Angeles, California, United States of America.
  • Nancy Sun
    Department of Cardiology, Cedars-Sinai Medical Center, Los Angeles, California, United States of America.
  • Patrick Botting
    Department of Cardiology, Cedars-Sinai Medical Center, Los Angeles, California, United States of America.
  • Jennifer A Fulcher
    Division of Infectious Diseases, Department of Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA.
  • Faysal G Saab
    Department of Medicine, David Geffen School of Medicine at UCLA, Los Angeles, California, United States of America.
  • Rachel Brook
    Department of Medicine, David Geffen School of Medicine at UCLA, Los Angeles, California, United States of America.
  • Eleazar Eskin
    1 Department of Computer Science, University of California, Los Angeles, California.
  • Ulzee An
    Department of Computer Science, UCLA, Los Angeles, California, United States of America.
  • Misagh Kordi
    Department of Computational Medicine, UCLA, Los Angeles, California, United States of America.
  • Brandon Jew
    Bioinformatics Interdepartmental Program, University of California, Los Angeles, CA, USA.
  • Brunilda Balliu
    Department of Computational Medicine, UCLA, Los Angeles, California, United States of America.
  • Zeyuan Chen
    Department of Computer Science, UCLA, Los Angeles, California, United States of America.
  • Brian L Hill
    Department of Computer Science, University of California, Los Angeles, CA, USA.
  • Elior Rahmani
    Department of Computer Science, UCLA, Los Angeles, California, United States of America.
  • Eran Halperin
    Departments of Computer Science and Biomathmatics, UCLA Henry Samueli School of Engineering and Applied Science.
  • Vladimir Manuel
    Faculty Practice Group, David Geffen School of Medicine at UCLA, Los Angeles, California, United States of America.