Comparative evaluation of AI platforms "Google Gemini 2.5 Flash, Google Gemini 2.0 Flash, DeepSeek V3 and ChatGPT 4o" in solving multiple-choice questions from different subtopics of anatomy.

Journal: Surgical and radiologic anatomy : SRA
Published Date:

Abstract

PURPOSE: The rise of artificial intelligence (AI) based large language models (LLMs) had a profound impact on medical education. Given the widespread use of multiple-choice questions (MCQs) in anatomy education, it is likely that such queries are commonly directed to AI tools. The current study compared the accuracy level of different AI platforms for solving MCQs from various subtopics in Anatomy.

Authors

  • Anjali Singal
    Deptartment of Anatomy, All India Institute of Medical Sciences, Bathinda, India.
  • Swati Goyal
    Mass General Brigham Data Science Office, Boston, MA, United States of America.