Accuracy of LLMs in medical education: evidence from a concordance test with medical teacher.
Journal:
BMC medical education
PMID:
40140805
Abstract
BACKGROUND: There is an unprecedented increase in the use of Generative AI in medical education. There is a need to assess these models' accuracy to ensure patient safety. This study assesses the accuracy of ChatGPT, Gemini, and Copilot in answering multiple-choice questions (MCQs) compared to a qualified medical teacher.