Clinical Data Extraction and Normalization of Cyrillic Electronic Health Records Via Deep-Learning Natural Language Processing.

Journal: JCO clinical cancer informatics

Published Date: Sep 1, 2019

Abstract

PURPOSE: A substantial portion of medical data is unstructured. Extracting data from unstructured text presents a barrier to advancing clinical research and improving patient care. In addition, ongoing studies have been focused predominately on the English language, whereas inflected languages with non-Latin alphabets (such as Slavic languages with a Cyrillic alphabet) present numerous linguistic challenges. We developed deep-learning-based natural language processing algorithms for automatically extracting biomarker status of patients with breast cancer from three oncology centers in Bulgaria.

Authors

Boyang Zhao

Sqilline Health, Boston, MA.

Keywords

Algorithms Area Under Curve Biomarkers, Tumor Breast Neoplasms Bulgaria Deep Learning Electronic Health Records Female Humans Medical Informatics Natural Language Processing Neural Networks, Computer Reproducibility of Results

External Resources

View on PubMed Access via DOI PubMed (31577448)

Clinical Data Extraction and Normalization of Cyrillic Electronic Health Records Via Deep-Learning Natural Language Processing.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals