Task definition, annotated dataset, and supervised natural language processing models for symptom extraction from unstructured clinical notes.
Journal:
Journal of biomedical informatics
Published Date:
Dec 12, 2019
Abstract
INTRODUCTION: Machine learning (ML) and natural language processing have great potential to improve information extraction (IE) within electronic medical records (EMRs) for a wide variety of clinical search and summarization tools. Despite ML advancements, clinical adoption of real time IE tools for patient care remains low. Clinically motivated IE task definitions, publicly available annotated clinical datasets, and inclusion of subtasks such as coreference resolution and named entity normalization are critical for the development of useful clinical tools.