Comparative evaluation of six large language models in transfusion medicine: Addressing language and domain-specific challenges.
Journal:
Vox sanguinis
Published Date:
May 23, 2025
Abstract
BACKGROUND AND OBJECTIVES: Large language models (LLMs) such as GPT-4 are increasingly utilized in clinical and educational settings; however, their validity in subspecialized domains like transfusion medicine remains insufficiently characterized. This study assessed the performance of six LLMs on transfusion-related questions from Korean national licensing examinations for medical doctors (MDs) and medical technologists (MTs).
Authors
Keywords
No keywords available for this article.