Accuracy of LLMs in medical education: evidence from a concordance test with medical teacher.

Journal: BMC medical education
PMID:

Abstract

BACKGROUND: There is an unprecedented increase in the use of Generative AI in medical education. There is a need to assess these models' accuracy to ensure patient safety. This study assesses the accuracy of ChatGPT, Gemini, and Copilot in answering multiple-choice questions (MCQs) compared to a qualified medical teacher.

Authors

  • Vinaytosh Mishra
    Datta Meghe Institute of Higher Education & Research, Nagpur, Maharashtra, India. dr.vinaytosh@gmu.ac.ae.
  • Yotam Lurie
    Ben-Gurion University of the Negev, Be'er Sheva, Israel.
  • Shlomo Mark
    Shamoon College of Engineering, Ashdod, Israel.