Clinical Information Extraction with Large Language Models: A Case Study on Organ Procurement.

Journal: AMIA ... Annual Symposium proceedings. AMIA Symposium
Published Date:

Abstract

Recent work has demonstrated that large language models (LLMs) are powerful tools for clinical information extraction from unstructured text. However, existing approaches have largely ignored the extraction of numeric information such as laboratory tests and vital signs. In this article, we present a case study on organ procurement that evaluates the ability of LLMs to extract numeric data from clinical text. We first describe our LLM-based approach, introducing a prompting strategy for numeric extraction and novel heuristics to combat hallucination. We validate our approach on a hand-annotated set of 298 notes, demonstrating that it has high accuracy, precision and recall. We then highlight the value of our approach for downstream data analysis using a corpus of 43,719 notes on 14,342 potential organ donors. This case study is a key component of an ongoing collaboration that aims to make data on organ procurement publicly available for informatics research.

Authors

  • Hammaad Adam
    Massachusetts Institute of Technology, Cambridge, MA, USA.
  • Junjing Lin
    Takeda Pharmaceuticals, Cambridge, MA, USA.
  • Jianchang Lin
    Statistical and Quantitative Science, Data Sciences Institute, Takeda Pharmaceutical Co. Limited, 300 Mass Ave, West Chester, PA, 19380, USA.
  • Hillary Keenan
    Takeda Pharmaceuticals, Cambridge, MA, USA.
  • Ashia Wilson
    Massachusetts Institute of Technology, Cambridge, MA, USA.
  • Marzyeh Ghassemi
    Electrical Engineering and Computer Science, Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, United States.