Clinical Knowledge and Reasoning Abilities of AI Large Language Models in Anesthesiology: A Comparative Study on the American Board of Anesthesiology Examination.

Journal: Anesthesia and analgesia
Published Date:

Abstract

BACKGROUND: Over the past decade, artificial intelligence (AI) has expanded significantly with increased adoption across various industries, including medicine. Recently, AI-based large language models such as Generative Pretrained Transformer-3 (GPT-3), Bard, and Generative Pretrained Transformer-3 (GPT-4) have demonstrated remarkable language capabilities. While previous studies have explored their potential in general medical knowledge tasks, here we assess their clinical knowledge and reasoning abilities in a specialized medical context.

Authors

  • Mirana C Angel
    From the Department of Computer Science, University of California Irvine, Irvine, California.
  • Joseph B Rinehart
    Department of Anesthesiology & Perioperative Care, University of California Irvine, Irvine, California.
  • Maxime P Cannesson
    Department of Anesthesiology & Perioperative Medicine, University of California Los Angeles, Los Angeles, California.
  • Pierre Baldi
    Department of Computer Science, Department of Biological Chemistry, University of California-Irvine, Irvine, CA 92697, USA.