Diagnostic Performance of ChatGPT-4o and DeepSeek-3 Differential Diagnosis of Complex Oral Lesions: A Multimodal Imaging and Case Difficulty Analysis.

Journal: Oral diseases
Published Date:

Abstract

BACKGROUND: AI models like ChatGPT-4o and DeepSeek-3 show diagnostic promise, but their reliability in complex, image-based oral lesions remains unclear. This study aimed to evaluate and compare the diagnostic accuracy of ChatGPT-4o and DeepSeek-3 despite their differing modalities against oral medicine (OM) experts across varied lesion types and case difficulty levels.

Authors

  • Fatma E A Hassanein
    Oral Medicine, Periodontology, and Oral Diagnosis, Faculty of Dentistry, King Salman International University, El Tur, Egypt.
  • Ahmed El Barbary
    Oral Medicine and Periodontology, Faculty of Dentistry, Cairo University, Giza, Egypt.
  • Radwa R Hussein
    Oral Medicine and Periodontology, Ain Shams University, Cairo, Egypt.
  • Yousra Ahmed
    Prosthodontics Dentistry, Faculty of Dentistry, King Salman International University, El Tur, Egypt.
  • Jylan El-Guindy
    Prosthodontics Dentistry, Faculty of Dentistry, King Salman International University, El Tur, Egypt.
  • Susan Sarhan
    Oral Medicine and Periodontology, Ain Shams University in Egypt, Cairo, Egypt. susan@dent.asu.edu.eg.
  • Asmaa Abou-Bakr
    Oral Medicine and Periodontology, Faculty of Dentistry, Galala University, Suez, Egypt. Asmaa.AbdAlRaouf@gu.edu.eg.

Keywords

No keywords available for this article.