Comprehensive testing of large language models for extraction of structured data in pathology.
Journal:
Communications medicine
Published Date:
Mar 31, 2025
Abstract
BACKGROUND: Pathology departments generate large volumes of unstructured data as free-text diagnostic reports. Converting these reports into structured formats for analytics or artificial intelligence projects requires substantial manual effort by specialized personnel. While recent studies show promise in using advanced language models for structuring pathology data, they primarily rely on proprietary models, raising cost and privacy concerns. Additionally, important aspects such as prompt engineering and model quantization for deployment on consumer-grade hardware remain unaddressed.
Authors
Keywords
No keywords available for this article.