DeepSeek-R1 and GPT-4 are comparable in a complex diagnostic challenge: a historical control study.

Journal: International journal of surgery (London, England)

Published Date: Apr 3, 2025

Abstract

BACKGROUND: Large language models (LLMs) have demonstrated potential in medical diagnostics, but their accuracy in complex cases remains a subject of investigation. DeepSeek-R1, an open-source model with advanced reasoning capabilities, has gained global attention. This study evaluates the diagnostic performance of DeepSeek-R1 compared to GPT-4 in complex clinical cases.

Authors

Lining Chan

Department of Plastic Surgery, Xinhua Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, People's Republic of China.
Xinjie Xu
Kaiyang Lv

Keywords

Diagnosis, Differential Generative Artificial Intelligence Humans

External Resources

View on PubMed Access via DOI PubMed (40505040)

DeepSeek-R1 and GPT-4 are comparable in a complex diagnostic challenge: a historical control study.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

DeepSeek-R1 and GPT-4 are comparable in a complex diagnostic challenge: a historical control study.

Abstract

Authors

Keywords

External Resources

Don't Miss the Future of Medicine

Popular Topics

Recent Journals