Can ChatGPT-4 evaluate whether a differential diagnosis list contains the correct diagnosis as accurately as a physician?

Journal: Diagnosis (Berlin, Germany)
Published Date:

Abstract

OBJECTIVES: The potential of artificial intelligence (AI) chatbots, particularly the fourth-generation chat generative pretrained transformer (ChatGPT-4), in assisting with medical diagnosis is an emerging research area. While there has been significant emphasis on creating lists of differential diagnoses, it is not yet clear how well AI chatbots can evaluate whether the final diagnosis is included in these lists. This short communication aimed to assess the accuracy of ChatGPT-4 in evaluating lists of differential diagnosis compared to medical professionals' assessments.

Authors

  • Kazuya Mizuta
    Department of Diagnostic and Generalist Medicine, Dokkyo Medical University 12756 , Simotsuga-gun, Japan.
  • Takanobu Hirosawa
    Department of Diagnostic and Generalist Medicine, Dokkyo Medical University, Tochigi 321-0293, Japan.
  • Yukinori Harada
    Department of General Internal Medicine, Nagano Chuo Hospital, Nagano 380-0814, Japan.
  • Taro Shimizu
    Department of Diagnostic and Generalist Medicine Dokkyo Medical University Tochigi Japan.