A retrospective study of deep learning generalization across two centers and multiple models of X-ray devices using COVID-19 chest-X rays.

Journal: Scientific reports
Published Date:

Abstract

Generalization of deep learning (DL) algorithms is critical for the secure implementation of computer-aided diagnosis systems in clinical practice. However, broad generalization remains to be a challenge in machine learning. This research aims to identify and study potential factors that can affect the internal validation and generalization of DL networks, namely the institution where the images come from, the image processing applied by the X-ray device, and the type of response function of the X-ray device. For these purposes, a pre-trained convolutional neural network (CNN) (VGG16) was trained three times for classifying COVID-19 and control chest radiographs with the same hyperparameters, but using different combinations of data acquired in two institutions by three different X-ray device manufacturers. Regarding internal validation, the addition of images from an external institution to the training set did not modify the algorithm's internal performance, however, the inclusion of images acquired by a device from a different manufacturer decreased the performance up to 8% (p < 0.05). In contrast, generalization across institutions and X-ray devices with the same type of response function was achieved. Nonetheless, generalization was not observed across devices with different types of response function. This factor was the key impediment to achieving broad generalization in our research, followed by the device's image-processing and the inter-institutional differences, which both reduced generalization performance to 18.9% (p < 0.05), and 9.8% (p < 0.05), respectively. Finally, clustering analysis with features extracted by the CNN was performed, revealing a substantial dependence of feature values extracted by the pre-trained CNN on the X-ray device which acquired the images.

Authors

  • Pablo Menéndez Fernández-Miranda
    Departamento de Radiología, Hospital Universitario Rey Juan Carlos, Calle Gladiolo, s/n, 28933, Móstoles, Spain.
  • Enrique Marqués Fraguela
    Departamento de Radiofísica y Protección Radiológica, Hospital Universitario Marqués de Valdecilla, Avenida de Valdecilla s/n, Santander, Spain.
  • Marta Álvarez de Linera-Alperi
    Departamento de Otorrinolaringología, Clínica Universidad de Navarra, Calle del Marquesado de Santa Marta, 1, 28027, Madrid, Spain.
  • Miriam Cobo
    Advanced Computing and e-Science Research Group, Institute of Physics of Cantabria (IFCA), CSIC - UC, 39005, Santander, Cantabria, Spain. cobocano@ifca.unican.es.
  • Amaia Pérez Del Barrio
    Servicio de Radiología, Complejo Hospitalario de Navarra, C. de Irunlarrea, 3, 31008, Pamplona, Spain.
  • David Rodríguez González
    Advanced Computing and E-Science Research Group, Institute of Physics of Cantabria, Grupo de Computación y E-Ciencia, CSIC-UC, IFCA-CSIC, Avenida de los Castros s/n, 39005, Santander, Spain.
  • José A Vega
    Departamento de Morfología y Biología Celular, Universidad de Oviedo, Oviedo, Spain.
  • Lara Lloret Iglesias
    Advanced Computing and E-Science Research Group, Institute of Physics of Cantabria, Grupo de Computación y E-Ciencia, CSIC-UC, IFCA-CSIC, Avenida de los Castros s/n, 39005, Santander, Spain.