Complexities, variations, and errors of numbering within clinical notes: the potential impact on information extraction and cohort-identification.

Journal: BMC medical informatics and decision making
Published Date:

Abstract

BACKGROUND: Numbers and numerical concepts appear frequently in free text clinical notes from electronic health records. Knowledge of the frequent lexical variations of these numerical concepts, and their accurate identification, is important for many information extraction tasks. This paper describes an analysis of the variation in how numbers and numerical concepts are represented in clinical notes.

Authors

  • David A Hanauer
    Department of Pediatrics, University of Michigan Medical School, Ann Arbor, MI, USA; School of Information, University of Michigan, Ann Arbor, MI, USA. Electronic address: hanauer@med.umich.edu.
  • Qiaozhu Mei
    University of Michigan, Ann Arbor, MI.
  • V G Vinod Vydiswaran
    Department of Learning Health Sciences, University of Michigan Medical School, Ann Arbor, MI, USA.
  • Karandeep Singh
    Department of Internal Medicine and School of Information, University of Michigan, Ann Arbor, Michigan.
  • Zach Landis-Lewis
    Department of Learning Health Sciences, University of Michigan, Ann Arbor, MI, 48109, USA.
  • Chunhua Weng
    Department of Biomedical Informatics, Columbia University.