A systematic review of large language model (LLM) evaluations in clinical medicine.
Journal:
BMC medical informatics and decision making
Published Date:
Mar 7, 2025
Abstract
BACKGROUND: Large Language Models (LLMs), advanced AI tools based on transformer architectures, demonstrate significant potential in clinical medicine by enhancing decision support, diagnostics, and medical education. However, their integration into clinical workflows requires rigorous evaluation to ensure reliability, safety, and ethical alignment.