Evaluation of the performance of large language models in clinical decision-making in endodontics.

Journal: BMC oral health
PMID:

Abstract

BACKGROUND: Artificial intelligence (AI) chatbots are excellent at generating language. The growing use of generative AI large language models (LLMs) in healthcare and dentistry, including endodontics, raises questions about their accuracy. The potential of LLMs to assist clinicians' decision-making processes in endodontics is worth evaluating. This study aims to comparatively evaluate the answers provided by Google Bard, ChatGPT-3.5, and ChatGPT-4 to clinically relevant questions from the field of Endodontics.

Authors

  • Yağız Özbay
    Department of Endodontics, Faculty of Dentistry, Karabük University, Karabük, Türkiye. yagiz_ozbay@hotmail.com.
  • Deniz Erdoğan
    Private Dentist, Ankara, Türkiye.
  • Gözde Akbal Dinçer
    Department of Endodontics, Faculty of Dentistry, Okan University, İstanbul , Türkiye.