Synthetic neurosurgical data generation with generative adversarial networks and large language models:an investigation on fidelity, utility, and privacy.

Journal: Neurosurgical focus
Published Date:

Abstract

OBJECTIVE: Use of neurosurgical data for clinical research and machine learning (ML) model development is often limited by data availability, sample sizes, and regulatory constraints. Synthetic data offer a potential solution to challenges associated with accessing, sharing, and using real-world data (RWD). The aim of this study was to evaluate the capability of generating synthetic neurosurgical data with a generative adversarial network and large language model (LLM) to augment RWD, perform secondary analyses in place of RWD, and train an ML model to predict postoperative outcomes.

Authors

  • Austin A Barr
    1Cumming School of Medicine, University of Calgary, Alberta.
  • Eddie Guo
    Cumming School of Medicine, University of Calgary, Calgary, Alberta, Canada.
  • Brij S Karmur
    2Department of Clinical Neurosciences, Division of Neurosurgery, University of Calgary, Alberta, Canada.
  • Emre Sezgin
    The Abigail Wexner Research Institute, Nationwide Children's Hospital, Columbus, OH, United States.