An Automated Feature Engineering for Digital Rectal Examination Documentation using Natural Language Processing.

Journal: AMIA ... Annual Symposium proceedings. AMIA Symposium
PMID:

Abstract

Digital rectal examination (DRE) is considered a quality metric for prostate cancer care. However, much of the DRE related rich information is documented as free-text in clinical narratives. Therefore, we aimed to develop a natural language processing (NLP) pipeline for automatic documentation of DRE in clinical notes using a domain-specific dictionary created by clinical experts and an extended version of the same dictionary learned by clinical notes using distributional semantics algorithms. The proposed pipeline was compared to a baseline NLP algorithm and the results of the proposed pipeline were found superior in terms of precision (0.95) and recall (0.90) for documentation of DRE. We believe the rule-based NLP pipeline enriched with terms learned from the whole corpus can provide accurate and efficient identification of this quality metric.

Authors

  • Selen Bozkurt
    Department of Biostatistics and Medical Informatics, Akdeniz University Faculty of Medinice, 48000 Antalya, Turkey.
  • Jung In Park
    Department of Medicine, Center for Biomedical Informatics Research, Stanford University, Stanford, CA.
  • Kathleen Mary Kan
    Department of Urology, Stanford University School of Medicine, Stanford, CA.
  • Michelle Ferrari
    Stanford University, School of Medicine, Stanford, CA.
  • Daniel L Rubin
    Department of Biomedical Data Science, Stanford University School of Medicine Medical School Office Building, Stanford CA 94305-5479.
  • James D Brooks
    Department of Urology, Stanford School of Medicine, CA.
  • Tina Hernandez-Boussard
    Stanford Center for Biomedical Informatics Research, Stanford, California 94305, USA.