Text Mining of the Electronic Health Record: An Information Extraction Approach for Automated Identification and Subphenotyping of HFpEF Patients for Clinical Trials.

Journal: Journal of cardiovascular translational research

Published Date: Jun 5, 2017

Abstract

Precision medicine requires clinical trials that are able to efficiently enroll subtypes of patients in whom targeted therapies can be tested. To reduce the large amount of time spent screening, identifying, and recruiting patients with specific subtypes of heterogeneous clinical syndromes (such as heart failure with preserved ejection fraction [HFpEF]), we need prescreening systems that are able to automate data extraction and decision-making tasks. However, a major obstacle is the vast amount of unstructured free-form text in medical records. Here we describe an information extraction-based approach that automatically converts unstructured text into structured data, which is cross-referenced against eligibility criteria using a rule-based system to determine which patients qualify for a major HFpEF clinical trial (PARAGON). We show that we can achieve a sensitivity and positive predictive value of 0.95 and 0.86, respectively. Our open-source algorithm could be used to efficiently identify and subphenotype patients with HFpEF and other disorders.

Authors

Siddhartha R Jonnalagadda

Division of Health and Biomedical Informatics, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, 750 N. Lake Shore Drive, 11th Floor, Chicago, IL 60611, USA.
Abhishek K Adupa

Division of Health and Biomedical Informatics, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, 60611, USA.
Ravi P Garg

Division of Health and Biomedical Informatics, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, 60611, USA.
Jessica Corona-Cox

Division of Cardiology, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, 60611, USA.
Sanjiv J Shah

Division of Cardiology, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA.

Keywords

Algorithms Clinical Trials as Topic Data Mining Echocardiography Electronic Health Records Eligibility Determination Heart Failure Humans Natural Language Processing Patient Selection Phenotype Predictive Value of Tests Reproducibility of Results Stroke Volume

External Resources

View on PubMed Access via DOI PubMed (28585184)

Text Mining of the Electronic Health Record: An Information Extraction Approach for Automated Identification and Subphenotyping of HFpEF Patients for Clinical Trials.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals