The diagnostic and triage accuracy of the GPT-3 artificial intelligence model: an observational study.

Journal: The Lancet. Digital health
Published Date:

Abstract

BACKGROUND: Artificial intelligence (AI) applications in health care have been effective in many areas of medicine, but they are often trained for a single task using labelled data, making deployment and generalisability challenging. How well a general-purpose AI language model performs diagnosis and triage relative to physicians and laypeople is not well understood.

Authors

  • David M Levine
    Department of Biostatistics, University of Washington, School of Public Health, Seattle, WA, USA.
  • Rudraksh Tuwani
    Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA; Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
  • Benjamin Kompa
  • Amita Varma
    Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA; Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
  • Samuel G Finlayson
    Harvard Medical School, Boston, MA, United States.
  • Ateev Mehrotra
    Department of Health Care Policy, Harvard Medical School, Boston, Massachusetts.
  • Andrew Beam
    Department of Biomedical Informatics, Harvard Medical School, Boston, MA, United States of America; Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, United States of America.