The power of data mining in diagnosis of childhood pneumonia.

Journal: Journal of the Royal Society, Interface
Published Date:

Abstract

Childhood pneumonia is the leading cause of death of children under the age of 5 years globally. Diagnostic information on the presence of infection, severity and aetiology (bacterial versus viral) is crucial for appropriate treatment. However, the derivation of such information requires advanced equipment (such as X-rays) and clinical expertise to correctly assess observational clinical signs (such as chest indrawing); both of these are often unavailable in resource-constrained settings. In this study, these challenges were addressed through the development of a suite of data mining tools, facilitating automated diagnosis through quantifiable features. Findings were validated on a large dataset comprising 780 children diagnosed with pneumonia and 801 age-matched healthy controls. Pneumonia was identified via four quantifiable vital signs (98.2% sensitivity and 97.6% specificity). Moreover, it was shown that severity can be determined through a combination of three vital signs and two lung sounds (72.4% sensitivity and 82.2% specificity); addition of a conventional biomarker (C-reactive protein) further improved severity predictions (89.1% sensitivity and 81.3% specificity). Finally, we demonstrated that aetiology can be determined using three vital signs and a newly proposed biomarker (lipocalin-2) (81.8% sensitivity and 90.6% specificity). These results suggest that a suite of carefully designed machine learning tools can be used to support multi-faceted diagnosis of childhood pneumonia in resource-constrained settings, compensating for the shortage of expensive equipment and highly trained clinicians.

Authors

  • Elina Naydenova
    Department of Engineering Science, Institute of Biomedical Engineering, University of Oxford, Oxford, UK elina.naydenova@eng.ox.ac.uk.
  • Athanasios Tsanas
    Department of Engineering Science, Institute of Biomedical Engineering, University of Oxford, Oxford, UK.
  • Stephen Howie
    Child Survival Theme, Medical Research Council Unit, Serrekunda, The Gambia.
  • Climent Casals-Pascual
    Department of Clinical Microbiology, Centre de Diagnòstic Biomèdic (CDB), Hospital Clínic of Barcelona, Barcelona, Spain.
  • Maarten De Vos
    STADIUS Center for Dynamical Systems, Signal Processing and Data Analytics-Department of Electrical Engineering (ESAT), KU Leuven, Leuven, Belgium. maarten.devos@kuleuven.be.