Aligned-Layer Text Search in Clinical Notes.

Journal: Studies in health technology and informatics
Published Date:

Abstract

Search techniques in clinical text need to make fine-grained semantic distinctions, since medical terms may be negated, about someone other than the patient, or at some time other than the present. While natural language processing (NLP) approaches address these fine-grained distinctions, a task like patient cohort identification from electronic health records (EHRs) simultaneously requires a much more coarse-grained combination of evidence from the text and structured data of each patient's health records. We thus introduce aligned-layer language models, a novel approach to information retrieval (IR) that incorporates the output of other NLP systems. We show that this framework is able to represent standard IR queries, formulate previously impossible multi-layered queries, and customize the desired degree of linguistic granularity.

Authors

  • Stephen Wu
    School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA.
  • Andrew Wen
    Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, OR, USA.
  • Yanshan Wang
    Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA.
  • Sijia Liu
    These authors contributed equally to this study and Dr. Li is now working at IBM; Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA; Department of Computer Science and Engineering, University at Buffalo, Buffalo, NY, USA.
  • Hongfang Liu
    Department of Artificial Intelligence & Informatics, Mayo Clinic, Rochester, MN, United States.