DeepFake electrocardiograms using generative adversarial networks are the beginning of the end for privacy issues in medicine.

Journal: Scientific reports
Published Date:

Abstract

Recent global developments underscore the prominent role big data have in modern medical science. But privacy issues constitute a prevalent problem for collecting and sharing data between researchers. However, synthetic data generated to represent real data carrying similar information and distribution may alleviate the privacy issue. In this study, we present generative adversarial networks (GANs) capable of generating realistic synthetic DeepFake 10-s 12-lead electrocardiograms (ECGs). We have developed and compared two methods, named WaveGAN* and Pulse2Pulse. We trained the GANs with 7,233 real normal ECGs to produce 121,977 DeepFake normal ECGs. By verifying the ECGs using a commercial ECG interpretation program (MUSE 12SL, GE Healthcare), we demonstrate that the Pulse2Pulse GAN was superior to the WaveGAN* to produce realistic ECGs. ECG intervals and amplitudes were similar between the DeepFake and real ECGs. Although these synthetic ECGs mimic the dataset used for creation, the ECGs are not linked to any individuals and may thus be used freely. The synthetic dataset will be available as open access for researchers at OSF.io and the DeepFake generator available at the Python Package Index (PyPI) for generating synthetic ECGs. In conclusion, we were able to generate realistic synthetic ECGs using generative adversarial neural networks on normal ECGs from two population studies, thereby addressing the relevant privacy issues in medical datasets.

Authors

  • Vajira Thambawita
    SimulaMet, Oslo, Norway.
  • Jonas L Isaksen
    University of Copenhagen, 2200, Copenhagen N, Denmark.
  • Steven A Hicks
    SimulaMet, Oslo, Norway.
  • Jonas Ghouse
    University of Copenhagen, 2200, Copenhagen N, Denmark.
  • Gustav Ahlberg
    University of Copenhagen, 2200, Copenhagen N, Denmark.
  • Allan Linneberg
    Center for Clinical Research and Prevention, Copenhagen University Hospital, Bispebjerg and Frederiksberg, Copenhagen, Denmark.
  • Niels Grarup
    University of Copenhagen, 2200, Copenhagen N, Denmark.
  • Christina Ellervik
    University of Copenhagen, 2200, Copenhagen N, Denmark.
  • Morten Salling Olesen
    University of Copenhagen, 2200, Copenhagen N, Denmark.
  • Torben Hansen
    Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark.
  • Claus Graff
    Aalborg University, 9220, Aalborg Ø, Denmark.
  • Niels-Henrik Holstein-Rathlou
    University of Copenhagen, 2200, Copenhagen N, Denmark.
  • Inga Strümke
    SimulaMet, 0167, Oslo, Norway.
  • Hugo L Hammer
    Department of Computer Science, Oslo Metropolitan University, Oslo, Norway.
  • Mary M Maleckar
    Simula Research Laboratory, 1364, Fornebu, Norway.
  • Pål Halvorsen
    Center for Digital Engineering Simula Metropolitan, Fornebu 1364, Norway.
  • Michael A Riegler
    SimulaMet, Oslo, Norway.
  • Jørgen K Kanters
    University of Copenhagen, 2200, Copenhagen N, Denmark.