Can ChatGPT-4 evaluate whether a differential diagnosis list contains the correct diagnosis as accurately as a physician?

Journal: Diagnosis (Berlin, Germany)

Published Date: Mar 12, 2024

Abstract

OBJECTIVES: The potential of artificial intelligence (AI) chatbots, particularly the fourth-generation chat generative pretrained transformer (ChatGPT-4), in assisting with medical diagnosis is an emerging research area. While there has been significant emphasis on creating lists of differential diagnoses, it is not yet clear how well AI chatbots can evaluate whether the final diagnosis is included in these lists. This short communication aimed to assess the accuracy of ChatGPT-4 in evaluating lists of differential diagnosis compared to medical professionals' assessments.

Authors

Kazuya Mizuta

Department of Diagnostic and Generalist Medicine, Dokkyo Medical University 12756 , Simotsuga-gun, Japan.
Takanobu Hirosawa

Department of Diagnostic and Generalist Medicine, Dokkyo Medical University, Tochigi 321-0293, Japan.
Yukinori Harada

Department of General Internal Medicine, Nagano Chuo Hospital, Nagano 380-0814, Japan.
Taro Shimizu

Department of Diagnostic and Generalist Medicine Dokkyo Medical University Tochigi Japan.

Keywords

Artificial Intelligence Diagnosis, Differential Humans Physicians

External Resources

View on PubMed Access via DOI PubMed (38465399)

Can ChatGPT-4 evaluate whether a differential diagnosis list contains the correct diagnosis as accurately as a physician?

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals