Comparing Diagnostic Accuracy of Clinical Professionals and Large Language Models: Systematic Review and Meta-Analysis.

Journal: JMIR medical informatics

PMID: 40279517

Abstract

BACKGROUND: With the rapid development of artificial intelligence (AI) technology, especially generative AI, large language models (LLMs) have shown great potential in the medical field. Through massive medical data training, it can understand complex medical texts and can quickly analyze medical records and provide health counseling and diagnostic advice directly, especially in rare diseases. However, no study has yet compared and extensively discussed the diagnostic performance of LLMs with that of physicians.

Authors

Guxue Shan

Nanjing Drum Tower Hospital Clinical College of Nanjing University of Chinese Medicine, Nanjing, China.
Xiaonan Chen

School of Chemistry and Molecular Engineering, East China Normal University, Shanghai, 200241, PR China.
Chen Wang

Department of Cardiovascular Surgery, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China.
Li Liu

Metanotitia Inc., Shenzhen, China.
Yuanjing Gu

Department of Emergency, Nanjing Drum Tower Hospital, Nanjing, China.
Huiping Jiang
Tingqi Shi

Keywords

Artificial Intelligence Health Personnel Humans Language Large Language Models

External Resources

View on PubMed Access via DOI PubMed (40279517)

Comparing Diagnostic Accuracy of Clinical Professionals and Large Language Models: Systematic Review and Meta-Analysis.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals