Using Natural Language Processing to improve EHR Structured Data-based Surgical Site Infection Surveillance.

Journal: AMIA ... Annual Symposium proceedings. AMIA Symposium

Published Date: Mar 4, 2020

Abstract

Surgical Site Infection surveillance in healthcare systems is labor intensive and plagued by underreporting as current methodology relies heavily on manual chart review. The rapid adoption of electronic health records (EHRs) has the potential to allow the secondary use of EHR data for quality surveillance programs. This study aims to investigate the effectiveness of integrating natural language processing (NLP) outputs with structured EHR data to build machine learning models for SSI identification using real-world clinical data. We examined a set of models using structured data with and without NLP document-level, mention-level, and keyword features. The top-performing model was based on a Random Forest classifier enhanced with NLP document-level features achieving a 0.58 sensitivity, 0.97 specificity, 0.54 PPV, 0.98 NPV, and 0.52 F score. We further interrogated the feature contributions, analyzed the errors, and discussed future directions.

Authors

Jianlin Shi

University of Utah, Salt Lake City, UT, USA.
Siru Liu

School of Medicine, University of Utah, Salt Lake City, Utah, US.
Liese C C Pruitt

School of Medicine, University of Utah, Salt Lake City, Utah, US.
Carolyn L Luppens

School of Medicine, University of Utah, Salt Lake City, Utah, US.
Jeffrey P Ferraro

School of Medicine, University of Utah, Salt Lake City, Utah, US.
Adi V Gundlapalli

School of Medicine, University of Utah, Salt Lake City, Utah, US.
Wendy W Chapman

School of Medicine, University of Utah, Salt Lake City, Utah, US.
Brian T Bucher

School of Medicine, University of Utah, Salt Lake City, Utah, US.

Keywords

Algorithms Decision Trees Electronic Health Records Humans Information Storage and Retrieval Logistic Models Machine Learning Natural Language Processing Sensitivity and Specificity Support Vector Machine Surgical Wound Infection

External Resources

View on PubMed PubMed (32308875)

Using Natural Language Processing to improve EHR Structured Data-based Surgical Site Infection Surveillance.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals