COPD phenotypes and machine learning cluster analysis: A systematic review and future research agenda.

Journal: Respiratory medicine
Published Date:

Abstract

Chronic Obstructive Pulmonary Disease (COPD) is a highly heterogeneous condition projected to become the third leading cause of death worldwide by 2030. To better characterize this condition, clinicians have classified patients sharing certain symptomatic characteristics, such as symptom intensity and history of exacerbations, into distinct phenotypes. In recent years, the growing use of machine learning algorithms, and cluster analysis in particular, has promised to advance this classification through the integration of additional patient characteristics, including comorbidities, biomarkers, and genomic information. This combination would allow researchers to more reliably identify new COPD phenotypes, as well as better characterize existing ones, with the aim of improving diagnosis and developing novel treatments. Here, we systematically review the last decade of research progress, which uses cluster analysis to identify COPD phenotypes. Collectively, we provide a systematized account of the extant evidence, describe the strengths and weaknesses of the main methods used, identify gaps in the literature, and suggest recommendations for future research.

Authors

  • Vasilis Nikolaou
    Surrey Business School, University of Surrey, Guildford, GU2 7HX, UK. Electronic address: v.nikolaou@surrey.ac.uk.
  • Sebastiano Massaro
    Surrey Business School, University of Surrey, Guildford, GU2 7HX, UK; The Organizational Neuroscience Laboratory, London, WC1N 3AX, UK.
  • Masoud Fakhimi
    Surrey Business School, University of Surrey, Guildford, GU2 7HX, UK.
  • Lampros Stergioulas
    Surrey Business School, University of Surrey, Guildford, GU2 7HX, UK.
  • David Price
    Observational and Pragmatic Research Institute, Singapore, Singapore; Centre of Academic Primary Care, Division of Applied Health Sciences, University of Aberdeen, Aberdeen, UK.