Augmenting Fact and Date of Death in Electronic Health Records using Internet Media Sources: A Validation Study from Two Large Healthcare Systems

Journal: medRxiv
Published Date:

Abstract

To evaluate the validity of death ascertainment from publicly available internet media (IM) sources by benchmarking against state and Federal vital statics data for patients in two large healthcare systems from the US. We extracted names and dates of birth and death from publicly available data—including obituaries and memorial websites—using previously developed natural language processing models. These data were probabilistically matched to electronic health records (EHRs) from Mass General Brigham (MGB) and Vanderbilt University Medical Center (VUMC) on first name, last name, and date of birth. Using reference standards from state vital statistics databases from MA, CT, and VT for MGB and the National Death Index (NDI) for VUMC patients, we reported positive predicted values (PPV) considering cases where dates of death from IM sources were within 7 days of the reference standard to be true positives. We also reported sensitivity of deaths ascertained from IM sources. When probabilistically matching 8.1 million deaths extracted from public data to 78,848 deaths observed in the reference standards across two sites, 30,607 (38.8%) matched exactly. A PPV of 98.2% for MGB and 98.9% for VUMC was observed for exact matches, while <6% for non-exact matches. Considering only the exact matches, IM sources led to an improvement in sensitivity of death capture by 24% in MGB and 18% in VUMC, compared to using EHRs alone for death ascertainment. Using public information to augment mortality data increased capture of death meaningfully over reliance on EHR records alone.

Authors

  • Michele LeNoue-Newton; Mohammed al-Garadi; Kerry Ngan; Haritha Pillai; Ruth M. Reeves; Daniel Park; Dax M. Westerman; José J. Hernández-Muñoz; Xi Wang; Aida Kuzucan; Shirley V. Wang; Kueiyu Joshua Lin; Candace Fuller; Melissa McPheeters; Michael E. Matheny; Rishi J. Desai

Categories