Extracting a stroke phenotype risk factor from Veteran Health Administration clinical reports: an information content analysis.

Journal: Journal of biomedical semantics
PMID:

Abstract

BACKGROUND: In the United States, 795,000 people suffer strokes each year; 10-15 % of these strokes can be attributed to stenosis caused by plaque in the carotid artery, a major stroke phenotype risk factor. Studies comparing treatments for the management of asymptomatic carotid stenosis are challenging for at least two reasons: 1) administrative billing codes (i.e., Current Procedural Terminology (CPT) codes) that identify carotid images do not denote which neurovascular arteries are affected and 2) the majority of the image reports are negative for carotid stenosis. Studies that rely on manual chart abstraction can be labor-intensive, expensive, and time-consuming. Natural Language Processing (NLP) can expedite the process of manual chart abstraction by automatically filtering reports with no/insignificant carotid stenosis findings and flagging reports with significant carotid stenosis findings; thus, potentially reducing effort, costs, and time.

Authors

  • Danielle L Mowery
    Biomedical Informatics, University of Utah School of Medicine, Salt Lake City, UT.
  • Brian E Chapman
    University of Utah, Department of Radiology, 729 Arapeen Drive, Salt Lake City, UT 84108, United States. Electronic address: brian.chapman@utah.edu.
  • Mike Conway
    Department of Biomedical Informatics, School of Medicine University of Utah 421 Wakara Way Ste 140, Salt Lake City, UT 84108-3514, USA.
  • Brett R South
    University of Utah, Salt Lake City, Utah, USA.
  • Erin Madden
    San Francisco Veteran Affair Health Care System, San Francisco, CA USA.
  • Salomeh Keyhani
    San Francisco Veteran Affair Health Care System, San Francisco, CA USA.
  • Wendy W Chapman
    School of Medicine, University of Utah, Salt Lake City, Utah, US.