Testing and Evaluation of Health Care Applications of Large Language Models: A Systematic Review.

Journal: JAMA
Published Date:

Abstract

IMPORTANCE: Large language models (LLMs) can assist in various health care activities, but current evaluation approaches may not adequately identify the most useful application areas.

Authors

  • Suhana Bedi
    Department of Biomedical Data Science, Stanford School of Medicine, Stanford, California.
  • Yutong Liu
    School of Economics and Management, Communication University of China, Beijing 100024, China.
  • Lucy Orr-Ewing
    Clinical Excellence Research Center, Stanford University, Stanford, California.
  • Dev Dash
    Department of Emergency Medicine, Stanford University, Stanford, California.
  • Sanmi Koyejo
    Department of Computer Science, University of Illinois at Urbana-Champaign, Champaign, Illinois.
  • Alison Callahan
    Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, CA 94305.
  • Jason A Fries
    Department of Computer Science, Stanford University, Stanford, CA, 94305, USA. jason-fries@stanford.edu.
  • Michael Wornow
    Department of Computer Science, Stanford University, Stanford, CA, USA. Electronic address: mwornow@stanford.edu.
  • Akshay Swaminathan
    Stanford University School of Medicine, Stanford, CA, United States.
  • Lisa Soleymani Lehmann
    Department of Radiation Oncology, Brigham and Women's Hospital/Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA.
  • Hyo Jung Hong
    Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania.
  • Mehr Kashyap
    Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, California, USA.
  • Akash R Chaurasia
    Center for Biomedical Informatics Research, Stanford University, Stanford, California.
  • Nirav R Shah
    Clinical Excellence Research Center, Stanford University, Stanford, California (N.R.S.).
  • Karandeep Singh
    Department of Internal Medicine and School of Information, University of Michigan, Ann Arbor, Michigan.
  • Troy Tazbaz
    US Food and Drug Administration, Silver Spring, Maryland.
  • Arnold Milstein
    Stanford Clinical Excellence Research Center, Stanford University, Stanford, CA, USA.
  • Michael A Pfeffer
    Department of Medicine, Stanford University School of Medicine, Stanford, CA 94305, United States.
  • Nigam H Shah
    Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, CA, USA.