Evaluation of Chat Generative Pre-trained Transformer and Microsoft Copilot Performance on the American Society of Surgery of the Hand Self-Assessment Examinations.

Journal: Journal of hand surgery global online
Published Date:

Abstract

PURPOSE: Artificial intelligence advancements have the potential to transform medical education and patient care. The increasing popularity of large language models has raised important questions regarding their accuracy and agreement with human users. The purpose of this study was to evaluate the performance of Chat Generative Pre-Trained Transformer (ChatGPT), versions 3.5 and 4, as well as Microsoft Copilot, which is powered by ChatGPT-4, on self-assessment examination questions for hand surgery and compare results between versions.

Authors

  • Taylor R Rakauskas
    College of Medicine, Florida Atlantic University, Boca Raton, FL.
  • Antonio Da Costa
    College of Medicine, Florida Atlantic University, Boca Raton, FL.
  • Camberly Moriconi
    College of Medicine, Florida Atlantic University, Boca Raton, FL.
  • Gurnoor Gill
    College of Medicine, Florida Atlantic University, Boca Raton, FL.
  • Jeffrey W Kwong
    Department of Orthopaedic Surgery, University of California San Francisco, San Francisco, CA.
  • Nicolas Lee
    Department of Orthopaedic Surgery, University of California San Francisco, San Francisco, CA.

Keywords

No keywords available for this article.