Comparing Diagnostic Accuracy of Clinical Professionals and Large Language Models: Systematic Review and Meta-Analysis.

Journal: JMIR medical informatics
PMID:

Abstract

BACKGROUND: With the rapid development of artificial intelligence (AI) technology, especially generative AI, large language models (LLMs) have shown great potential in the medical field. Through massive medical data training, it can understand complex medical texts and can quickly analyze medical records and provide health counseling and diagnostic advice directly, especially in rare diseases. However, no study has yet compared and extensively discussed the diagnostic performance of LLMs with that of physicians.

Authors

  • Guxue Shan
    Nanjing Drum Tower Hospital Clinical College of Nanjing University of Chinese Medicine, Nanjing, China.
  • Xiaonan Chen
    School of Chemistry and Molecular Engineering, East China Normal University, Shanghai, 200241, PR China.
  • Chen Wang
    Department of Cardiovascular Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China.
  • Li Liu
    Metanotitia Inc., Shenzhen, China.
  • Yuanjing Gu
    Department of Emergency, Nanjing Drum Tower Hospital, Nanjing, China.
  • Huiping Jiang
  • Tingqi Shi