Extraction of left ventricular ejection fraction information from various types of clinical reports.

Journal: Journal of biomedical informatics
Published Date:

Abstract

Efforts to improve the treatment of congestive heart failure, a common and serious medical condition, include the use of quality measures to assess guideline-concordant care. The goal of this study is to identify left ventricular ejection fraction (LVEF) information from various types of clinical notes, and to then use this information for heart failure quality measurement. We analyzed the annotation differences between a new corpus of clinical notes from the Echocardiography, Radiology, and Text Integrated Utility package and other corpora annotated for natural language processing (NLP) research in the Department of Veterans Affairs. These reports contain varying degrees of structure. To examine whether existing LVEF extraction modules we developed in prior research improve the accuracy of LVEF information extraction from the new corpus, we created two sequence-tagging NLP modules trained with a new data set, with or without predictions from the existing LVEF extraction modules. We also conducted a set of experiments to examine the impact of training data size on information extraction accuracy. We found that less training data is needed when reports are highly structured, and that combining predictions from existing LVEF extraction modules improves information extraction when reports have less structured formats and a rich set of vocabulary.

Authors

  • Youngjun Kim
  • Jennifer H Garvin
    IDEAS Center SLC VA Healthcare System, 500 Foothill Drive, Salt Lake City, UT, 84148, USA. jennifer.garvin@va.gov.
  • Mary K Goldstein
    VA Palo Alto Health Care System, Palo Alto, CA, and Stanford University, Stanford, CA, USA.
  • Tammy S Hwang
    VA Palo Alto Health Care System, Palo Alto, CA, USA.
  • Andrew Redd
    Andrew Redd, PhD, is an Assistant Professor at the University of Utah and a statistician in the IDEAS Center at the VA Salt Lake City Health Care System in Salt Lake City, UT.
  • Dan Bolton
    Dan Bolton, MS, is a statistician in the IDEAS Center at the VA Salt Lake City Health Care System in Salt Lake City, UT.
  • Paul A Heidenreich
    Veterans Administration Palo Alto Health Care System, Palo Alto, California.
  • Stéphane M Meystre
    Department of Biomedical Informatics, University of Utah, Salt Lake City, USA.