Magnitude and Impact of Hallucinations in Tabular Synthetic Health Data on Prognostic Machine Learning Models: Validation Study.

Journal: Journal of medical Internet research
Published Date:

Abstract

BACKGROUND: Generative artificial intelligence (AI) for tabular synthetic data generation (SDG) has significant potential to accelerate health care research and innovation. A critical limitation of generative AI, however, is hallucinations. Although this has been commonly observed in text-generating models, it may also occur in tabular SDG.

Authors

  • Lisa Pilgram
    School of Epidemiology and Public Health, Faculty of Medicine, University of Ottawa, Ottawa, ON, Canada.
  • Samer El Kababji
    CHEO Research Institute, Children's Hospital of Eastern Ontario, Ottawa, ON, Canada.
  • Dan Liu
    Department of Bioengineering, Temple University, Philadelphia, PA, United States.
  • Khaled El Emam
    University of Ottawa, Ottawa, ON, Canada.

Keywords

No keywords available for this article.