Role of Artificial Intelligence in Surgical Training by Assessing GPT-4 and GPT-4o on the Japan Surgical Board Examination With Text-Only and Image-Accompanied Questions: Performance Evaluation Study.

Journal: JMIR medical education
Published Date:

Abstract

BACKGROUND: Artificial intelligence and large language models (LLMs)-particularly GPT-4 and GPT-4o-have demonstrated high correct-answer rates in medical examinations. GPT-4o has enhanced diagnostic capabilities, advanced image processing, and updated knowledge. Japanese surgeons face critical challenges, including a declining workforce, regional health care disparities, and work-hour-related challenges. Nonetheless, although LLMs could be beneficial in surgical education, no studies have yet assessed GPT-4o's surgical knowledge or its performance in the field of surgery.

Authors

  • Hiroki Maruyama
    Division of Gastroenterology & Hepatology Graduate School of Medical and Dental Sciences, Niigata University Niigata Japan.
  • Yoshitaka Toyama
    Department of Diagnostic Radiology, Tohoku University Hospital, 1-1 Seiryo-machi, Aoba-ku, Sendai, 980-8574, Japan. ytoyama0818@gmail.com.
  • Kentaro Takanami
    Department of Diagnostic Radiology, Tohoku University Hospital, 1-1 Seiryo-machi, Aoba-ku, Sendai, 980-8574, Japan.
  • Kei Takase
    Department of Diagnostic Radiology, Tohoku University Hospital, 1-1 Seiryo-machi, Aoba-ku, Sendai, 980-8574, Japan.
  • Takashi Kamei