A Comparison of LLMs for Use in Generating Synthetic Test Data for Automated Testing of a Patient-Focused, Survey-Based System.

Journal: AMIA ... Annual Symposium proceedings. AMIA Symposium

Published Date: May 22, 2025

Abstract

In the context of a patient-focused, survey-based system, we demonstrated the potential of generative AI to create custom synthetic data using 2 different large language models (GPT 3.5 and Flan T5-XL) in AWS and Azure environments. While we improved test effectiveness and efficiency by synthetically generating many test cases, the experience included technical and communication challenges as well as complexities associated with balancing the desire for high utility and realism in the data with the available testing resources. Recommendations range from defining and gaining consensus on evaluation metrics early in the process as it influences technical questions like persona creation and prompt-engineering to encouraging test teams to build flexible frameworks from the start.

Authors

Catherine L Anderson

Accenture Federal Services, Arlington, VA.
Marjorie R Willner

Accenture Federal Services, Arlington, VA.
Heather G Patsolic

Accenture Federal Services, Arlington, VA.
Larry Brem

National Cancer Institute, Rockville, MD.
Gelila Aboye

Accenture Federal Services, Arlington, VA.
Daniel Smolyak

University of Maryland, College Park, MD.
Kenyon Crowley

Accenture Federal Services, Arlington, VA.

Keywords

Artificial Intelligence Humans Natural Language Processing Patient-Centered Care Programming Languages Surveys and Questionnaires

External Resources

View on PubMed PubMed (40417501)

A Comparison of LLMs for Use in Generating Synthetic Test Data for Automated Testing of a Patient-Focused, Survey-Based System.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals