Comparative performance of artificial intelligence models in rheumatology board-level questions: evaluating Google Gemini and ChatGPT-4o.

Journal: Clinical rheumatology
PMID:

Abstract

OBJECTIVES: This study evaluates the performance of AI models, ChatGPT-4o and Google Gemini, in answering rheumatology board-level questions, comparing their effectiveness, reliability, and applicability in clinical practice.

Authors

  • Enes Efe İş
    University of Health Sciences, Sisli Etfal Education and Training Hospital, Department of Physical Medicine and Rehabilitation - İstanbul, Turkey.
  • Ahmet Kivanc Menekseoglu
    Department of Physical Medicine and Rehabilitation, Kanuni Sultan Süleyman Training and Research Hospital, University of Health Sciences, Istanbul, Turkey.