The performance of ChatGPT on medical image-based assessments and implications for medical education.

Journal: BMC medical education

Published Date: Aug 23, 2025

Abstract

BACKGROUND: Generative artificial intelligence (AI) tools like ChatGPT (OpenAI) have garnered significant attention for their potential in fields such as medical education; however, their performance of large language and vision models on medical test items involving images remains underexplored, limiting their broader educational utility. This study aims to evaluate the performance of GPT-4 and GPT-4 Omni (GPT-4o), accessed via the ChatGPT platform, on image-based United States Medical Licensing Examination (USMLE) sample items, to explore their implications for medical education.

Authors

Xiang Yang

Department of Neurosurgery, West China Hospital, Sichuan University, No. 37 Guo Xue Xiang Alley, Wu Hou Distract, Chengdu, Sichuan Province, 610037, China.
Wei Chen

Department of Urology, Zigong Fourth People's Hospital, Sichuan, China.

Keywords

Artificial Intelligence Clinical Competence Diagnostic Imaging Education, Medical Educational Measurement Generative Artificial Intelligence Humans Licensure, Medical United States

External Resources

View on PubMed Access via DOI PubMed (40849473)

The performance of ChatGPT on medical image-based assessments and implications for medical education.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

The performance of ChatGPT on medical image-based assessments and implications for medical education.

Abstract

Authors

Keywords

External Resources

Stay Ahead of Medical AI

Popular Topics

Recent Journals