Towards accurate differential diagnosis with large language models.

Journal: Nature
Published Date:

Abstract

A comprehensive differential diagnosis is a cornerstone of medical care that is often reached through an iterative process of interpretation that combines clinical history, physical examination, investigations and procedures. Interactive interfaces powered by large language models present new opportunities to assist and automate aspects of this process. Here we introduce the Articulate Medical Intelligence Explorer (AMIE), a large language model that is optimized for diagnostic reasoning, and evaluate its ability to generate a differential diagnosis alone or as an aid to clinicians. Twenty clinicians evaluated 302 challenging, real-world medical cases sourced from published case reports. Each case report was read by two clinicians, who were randomized to one of two assistive conditions: assistance from search engines and standard medical resources; or assistance from AMIE in addition to these tools. All clinicians provided a baseline, unassisted differential diagnosis prior to using the respective assistive tools. AMIE exhibited standalone performance that exceeded that of unassisted clinicians (top-10 accuracy 59.1% versus 33.6%, P = 0.04). Comparing the two assisted study arms, the differential diagnosis quality score was higher for clinicians assisted by AMIE (top-10 accuracy 51.7%) compared with clinicians without its assistance (36.1%; McNemar's test: 45.7, P < 0.01) and clinicians with search (44.4%; McNemar's test: 4.75, P = 0.03). Further, clinicians assisted by AMIE arrived at more comprehensive differential lists than those without assistance from AMIE. Our study suggests that AMIE has potential to improve clinicians' diagnostic reasoning and accuracy in challenging cases, meriting further real-world evaluation for its ability to empower physicians and widen patients' access to specialist-level expertise.

Authors

  • Daniel McDuff
  • Mike Schaekermann
    Google Health, Google LLC, Mountain View, California.
  • Tao Tu
    Google Research, Mountain View, CA, USA.
  • Anil Palepu
    Department of Health Sciences and Technology, Harvard-MIT, Cambridge, MA, USA.
  • Amy Wang
    From the Departments of Diagnostic Imaging (M.T.S., M.J., J.L.B., G.L.B., R.A.M.), Diagnostic Imaging (A.D.Y.), and Neurosurgery (M.J., R.A.M.), Warren Alpert School of Medicine at Brown University, Rhode Island Hospital, 593 Eddy St, APC 701, Providence, RI 02903; Department of Computer Science, Brown University, Providence, RI (J.V., M.P.D., Y.H.K., S.S.S., H.J.T., A.W., H.L.C.W., C.E., U.C.); and the Norman Prince Neuroscience Institute, Rhode Island Hospital, Providence, RI (M.J., R.A.M.).
  • Jake Garrison
  • Karan Singhal
    Google Research, Mountain View, CA, USA. karansinghal@google.com.
  • Yash Sharma
    Google Research, Mountain View, CA, USA.
  • Shekoofeh Azizi
  • Kavita Kulkarni
    Google Research, Mountain View, CA, USA.
  • Le Hou
    Department of Computer Science, Stony Brook University, Stony Brook, NY 11794, USA.
  • Yong Cheng
    Department of Urology, The Affiliated Hospital of Southwest Medical University, Luzhou, China.
  • Yun Liu
    Google Health, Palo Alto, CA USA.
  • S Sara Mahdavi
    Google Research, Mountain View, CA, USA.
  • Sushant Prakash
    Google Research, New York City, NY, USA.
  • Anupam Pathak
    Google Research, Mountain View, CA, USA.
  • Christopher Semturs
    Google Health, Google LLC, Mountain View, California.
  • Shwetak Patel
  • Dale R Webster
    Google Inc, Mountain View, California.
  • Ewa Dominowska
    Google Research, Seattle, WA, USA.
  • Juraj Gottweis
    Google Research, Mountain View, CA, USA.
  • Joelle Barral
    Google Research, Mountain View, CA, USA.
  • Katherine Chou
    Google Research, San Jose, CA, USA.
  • Greg S Corrado
    Google Health, Palo Alto, CA USA.
  • Yossi Matias
    Google Research, Google LLC, 1600 Amphitheatre Parkway, Mountain View, CA, USA.
  • Jake Sunshine
    Google Research, Seattle, WA, USA. jakesunshine@google.com.
  • Alan Karthikesalingam
    Department of Outcomes Research, St George's Vascular Institute, London, SW17 0QT, United Kingdom.
  • Vivek Natarajan
    Google, Mountain View, CA, USA.