The performance of ChatGPT on medical image-based assessments and implications for medical education.

Journal: BMC medical education
Published Date:

Abstract

BACKGROUND: Generative artificial intelligence (AI) tools like ChatGPT (OpenAI) have garnered significant attention for their potential in fields such as medical education; however, their performance of large language and vision models on medical test items involving images remains underexplored, limiting their broader educational utility. This study aims to evaluate the performance of GPT-4 and GPT-4 Omni (GPT-4o), accessed via the ChatGPT platform, on image-based United States Medical Licensing Examination (USMLE) sample items, to explore their implications for medical education.

Authors

  • Xiang Yang
    Department of Neurosurgery, West China Hospital, Sichuan University, No. 37 Guo Xue Xiang Alley, Wu Hou Distract, Chengdu, Sichuan Province, 610037, China.
  • Wei Chen
    Department of Urology, Zigong Fourth People's Hospital, Sichuan, China.