Extracting Sexual Trauma Mentions from Electronic Medical Notes Using Natural Language Processing.

Journal: Studies in health technology and informatics
Published Date:

Abstract

Patient history of sexual trauma is of clinical relevance to healthcare providers as survivors face adverse health-related outcomes. This paper describes a method for identifying mentions of sexual trauma within the free text of electronic medical notes. A natural language processing pipeline for information extraction was developed and scaled to handle a large corpus of electronic medical notes used for this study from US Veterans Health Administration medical facilities. The tool was used to identify sexual trauma mentions and create snippets around every asserted mention based on a domain-specific lexicon developed for this purpose. All snippets were evaluated by trained human reviewers. An overall positive predictive value (PPV) of 0.90 for identifying sexual trauma mentions from the free text and a PPV of 0.71 at the patient level are reported. The metrics are superior for records from female patients.

Authors

  • Guy Divita
    VA Salt Lake City Health Care System, Salt Lake City, Utah, USA.
  • Emily Brignone
    VA Salt Lake City Health Care System, Salt Lake City, Utah, USA.
  • Marjorie E Carter
    IDEAS Center, VA Salt Lake City Health Care System, Salt Lake City, Utah, USA; Departments of Internal Medicine and Biomedical Informatics, University of Utah School of Medicine, Salt Lake City, Utah, USA.
  • Ying Suo
    VA Salt Lake City Health Care System, Salt Lake City, Utah, USA.
  • Rebecca K Blais
    VA Salt Lake City Health Care System, Salt Lake City, Utah, USA.
  • Matthew H Samore
    VA Salt Lake City Health Care System, Salt Lake City, Utah, USA.
  • Jamison D Fargo
    VA Salt Lake City Health Care System, Salt Lake City, Utah, USA.
  • Adi V Gundlapalli
    School of Medicine, University of Utah, Salt Lake City, Utah, US.