Chatbots in urology: accuracy, calibration, and comprehensibility; is DeepSeek taking over the throne?

Journal: BJU international
Published Date:

Abstract

OBJECTIVE: To evaluate widely used chatbots' accuracy, calibration error, readability, and understandability with objective measurements by 35 questions derived from urology in-service examinations, as the integration of large language models (LLMs) into healthcare has gained increasing attention, raising questions about their applications and limitations.

Authors

  • Omer Faruk Asker
    School of Medicine, Marmara University, Istanbul, Turkey.
  • Muhammed Selim Recai
    School of Medicine, Marmara University, Istanbul, Turkey.
  • Yunus Emre Genc
    Department of Urology, School of Medicine, Marmara University, Istanbul, Turkey.
  • Kader Ada Dogan
    Department of Urology, School of Medicine, Marmara University, Istanbul, Turkey.
  • Tarik Emre Sener
  • Bahadir Sahin

Keywords

No keywords available for this article.