A data science approach for early-stage prediction of Patient's susceptibility to acute side effects of advanced radiotherapy.

Journal: Computers in biology and medicine
Published Date:

Abstract

The prediction by classification of side effects incidence in a given medical treatment is a common challenge in medical research. Machine Learning (ML) methods are widely used in the areas of risk prediction and classification. The primary objective of such algorithms is to use several features to predict dichotomous responses (e.g., disease positive/negative). Similar to statistical inference modelling, ML modelling is subject to the class imbalance problem and is affected by the majority class, increasing the false-negative rate. In this study, seventy-nine ML models were built and evaluated to classify approximately 2000 participants from 26 hospitals in eight different countries into two groups of radiotherapy (RT) side effects incidence based on recorded observations from the international study of RT related toxicity "REQUITE". We also examined the effect of sampling techniques and cost-sensitive learning methods on the models when dealing with class imbalance. The combinations of such techniques used had a significant impact on the classification. They resulted in an improvement in incidence status prediction by shifting classifiers' attention to the minority group. The best classification model for RT acute toxicity prediction was identified based on domain experts' success criteria. The Area Under Receiver Operator Characteristic curve of the models tested with an isolated dataset ranged from 0.50 to 0.77. The scale of improved results is promising and will guide further development of models to predict RT acute toxicities. One model was optimised and found to be beneficial to identify patients who are at risk of developing acute RT early-stage toxicities as a result of undergoing breast RT ensuring relevant treatment interventions can be appropriately targeted. The design of the approach presented in this paper resulted in producing a preclinical-valid prediction model. The study was developed by a multi-disciplinary collaboration of data scientists, medical physicists, oncologists and surgeons in the UK Radiotherapy Machine Learning Network.

Authors

  • Mahmoud Aldraimli
    The Health Innovation Ecosystem, University of Westminster, London, UK. Electronic address: w1654353@my.westminster.ac.uk.
  • Daniele Soria
  • Diana Grishchuck
    Imperial College Healthcare NHS Trust, London, UK.
  • Samuel Ingram
    Division of Cancer Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, UK.
  • Robert Lyon
    Department of Computer Science, Edge Hill University, Ormskirk, Lancashire, UK.
  • Anil Mistry
    Guy's and St Thomas' NHS Foundation Trust, London, UK.
  • Jorge Oliveira
  • Robert Samuel
    University of Leeds, Leeds Cancer Centre, St. James's University Hospital, Leeds, UK.
  • Leila E A Shelley
    Edinburgh Cancer Centre, Western General Hospital, Crewe Road South, Edinburgh, UK.
  • Sarah Osman
    Centre for Cancer Research and Cell Biology, Queens' University, Belfast, UK.
  • Miriam V Dwek
    School of Life Sciences, University of Westminster, London, UK.
  • David Azria
    Institut du Cancer de Montpellier, Department of Radiation Oncology, Montpellier, France.
  • Jenny Chang-Claude
    Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany.
  • Sara Gutiérrez-Enríquez
    Vall d'Hebron Institute of Oncology, Barcelona, Spain.
  • Maria Carmen De Santis
    Dept of Radiation Oncology 1, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy.
  • Barry S Rosenstein
    Department of Radiation Oncology and the Department of Genetics and Genomic Sciences, Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA.
  • Dirk de Ruysscher
    a Department of Radiation Oncology (MAASTRO), GROW - School for Oncology and Developmental Biology , Maastricht University Medical Centre , Maastricht , The Netherlands.
  • Elena Sperk
    Department of Radiation Oncology, University Medical Center Mannheim, Medical Faculty Mannheim, Heidelberg University, Germany.
  • R Paul Symonds
    Department of Oncology, Leicester Royal Infirmary, UK.
  • Hilary Stobart
    Independent Cancer Patients' Voice, London, UK.
  • Ana Vega
    Fundación Publica Galega Medicina Xenomica, Santiago de Compostela, Spain.
  • Liv Veldeman
    Department of Basic Medical Sciences, University Hospital Ghent, Belgium.
  • Adam Webb
    Department of Genetics and Genome Biology, University of Leicester, UK.
  • Christopher J Talbot
    Cancer Research Centre, University of Leicester, Leicester, UK.
  • Catharine M West
    Institute of Cancer Sciences, Christie Hospital, Wilmslow Road, Manchester, UK.
  • Tim Rattay
    Department of Genetics and Genome Biology, University of Leicester Cancer Research Centre, Leicester, UK.
  • Thierry J Chaussalet
    The Health Innovation Ecosystem, University of Westminster, London, UK.