Validity of Natural Language Processing for Ascertainment of and Test Results in SEER Cases of Stage IV Non-Small-Cell Lung Cancer.
Journal:
JCO clinical cancer informatics
PMID:
31058542
Abstract
PURPOSE: SEER registries do not report results of epidermal growth factor receptor () and anaplastic lymphoma kinase () mutation tests. To facilitate population-based research in molecularly defined subgroups of non-small-cell lung cancer (NSCLC), we assessed the validity of natural language processing (NLP) for the ascertainment of EGFR and ALK testing from electronic pathology (e-path) reports of NSCLC cases included in two SEER registries: the Cancer Surveillance System (CSS) and the Kentucky Cancer Registry (KCR).
Authors
Keywords
Adult
Aged
Algorithms
Anaplastic Lymphoma Kinase
Carcinoma, Non-Small-Cell Lung
DNA Mutational Analysis
ErbB Receptors
Female
Genetic Testing
Humans
Kentucky
Lung Neoplasms
Machine Learning
Male
Middle Aged
Mutation
Natural Language Processing
Population Surveillance
Registries
Reproducibility of Results
SEER Program