Chatbots in urology: accuracy, calibration, and comprehensibility; is DeepSeek taking over the throne?

Journal: BJU international

Published Date: Jul 31, 2025

Abstract

OBJECTIVE: To evaluate widely used chatbots' accuracy, calibration error, readability, and understandability with objective measurements by 35 questions derived from urology in-service examinations, as the integration of large language models (LLMs) into healthcare has gained increasing attention, raising questions about their applications and limitations.

Authors

Omer Faruk Asker

School of Medicine, Marmara University, Istanbul, Turkey.
Muhammed Selim Recai

School of Medicine, Marmara University, Istanbul, Turkey.
Yunus Emre Genc

Department of Urology, School of Medicine, Marmara University, Istanbul, Turkey.
Kader Ada Dogan

Department of Urology, School of Medicine, Marmara University, Istanbul, Turkey.
Tarik Emre Sener
Bahadir Sahin

Keywords

No keywords available for this article.

External Resources

View on PubMed Access via DOI PubMed (40741907)

Chatbots in urology: accuracy, calibration, and comprehensibility; is DeepSeek taking over the throne?

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

Chatbots in urology: accuracy, calibration, and comprehensibility; is DeepSeek taking over the throne?

Abstract

Authors

Keywords

External Resources

Stay Ahead of Medical AI

Popular Topics

Recent Journals