Deep continual learning for medical call incidents text classification under the presence of dataset shifts.

Journal: Computers in biology and medicine
PMID:

Abstract

The aim of this work is to develop and evaluate a deep classifier that can effectively prioritize Emergency Medical Call Incidents (EMCI) according to their life-threatening level under the presence of dataset shifts. We utilized a dataset consisting of 1982746 independent EMCI instances obtained from the Health Services Department of the Region of Valencia (Spain), with a time span from 2009 to 2019 (excluding 2013). The dataset includes free text dispatcher observations recorded during the call, as well as a binary variable indicating whether the event was life-threatening. To evaluate the presence of dataset shifts, we examined prior probability shifts, covariate shifts, and concept shifts. Subsequently, we designed and implemented four deep Continual Learning (CL) strategies-cumulative learning, continual fine-tuning, experience replay, and synaptic intelligence-alongside three deep CL baselines-joint training, static approach, and single fine-tuning-based on DistilBERT models. Our results demonstrated evidence of prior probability shifts, covariate shifts, and concept shifts in the data. Applying CL techniques had a statistically significant (α=0.05) positive impact on both backward and forward knowledge transfer, as measured by the F1-score, compared to non-continual approaches. We can argue that the utilization of CL techniques in the context of EMCI is effective in adapting deep learning classifiers to changes in data distributions, thereby maintaining the stability of model performance over time. To our knowledge, this study represents the first exploration of a CL approach using real EMCI data.

Authors

  • Pablo Ferri
    Biomedical Data Science Laboratory (BDSLab), Instituto de Aplicaciones de las Tecnologías de la Información y de las Comunicaciones Avanzadas (ITACA), Universitat Politècnica de València (UPV), Valencia, Spain. Electronic address: pabferb2@upv.es.
  • Vincenzo Lomonaco
    Department of Computer Science and Engineering, University of Bologna, Italy. Electronic address: vincenzo.lomonaco@unibo.it.
  • Lucia C Passaro
    Department of Computer Science, University of Pisa (Unipi), Pisa, Italy. Electronic address: lucia.passaro@unipi.it.
  • Antonio Félix-De Castro
    Conselleria de Sanitat Universal i Salut Pública, Generalitat Valenciana (GVA), Valencia, Spain. Electronic address: felix_antdec@gva.es.
  • Purificación Sánchez-Cuesta
    Conselleria de Sanitat Universal i Salut Pública, Generalitat Valenciana (GVA), Valencia, Spain. Electronic address: sanchez_pur@gva.es.
  • Carlos Sáez
  • Juan M García-Gómez
    Biomedical Data Science Lab. Instituto de Aplicaciones de las Tecnologías de la Información y de las Comunicaciones Avanzadas (ITACA), Universitat Politécnica de Valéncia, Camino de Vera s/n, Valencia 46022, Spain. Electronic address: juanmig@ibime.upv.es.