Preprocessing of natural language process variables using a data-driven method improves the association with suicide risk in a large veterans affairs population.
Journal:
Computers in biology and medicine
PMID:
40048900
Abstract
OBJECTIVE: Suicide risk assessment has historically relied heavily on clinical evaluations and patient self-reports. Natural language processing (NLP) of electronic health records (EHRs) provides an alternative approach for extracting risk predictors from clinical notes. Modeling NLP variables, however, is challenging because of zero inflation and skewed distributions. Therefore, we evaluated whether an adaptive-mixture-categorization (AMC) method could optimize the suicide risk predictive capacity of NLP data extracted from Veterans Affairs (VA) EHR notes.
Authors
Keywords
Adolescent
Adult
Aged
Aged, 80 and over
Case-Control Studies
Electronic Health Records
Female
Humans
Male
Middle Aged
Natural Language Processing
Predictive Learning Models
Psychological Well-Being
Risk Assessment
Suicide Prevention
Suicide, Completed
United States
United States Department of Defense
United States Department of Veterans Affairs
Veterans
Young Adult