AI Accelerated Human-in-the-loop Structuring of Radiology Reports.

Journal: AMIA ... Annual Symposium proceedings. AMIA Symposium
Published Date:

Abstract

Rule-based Natural Language Processing (NLP) pipelines depend on robust domain knowledge. Given the long tail of important terminology in radiology reports, it is not uncommon for standard approaches to miss items critical for understanding the image. AI techniques can accelerate the concept expansion and phrasal grouping tasks to efficiently create a domain specific lexicon ontology for structuring reports. Using Chest X-ray (CXR) reports as an example, we demonstrate that with robust vocabulary, even a simple NLP pipeline can extract 83 directly mentioned abnormalities (Ave. recall=93.83%, precision=94.87%) and 47 abnormality/normality descriptions of key anatomies. The richer vocabulary enables identification of additional label mentions in 10 out of 13 labels (compared to baseline methods). Furthermore, it captures expert insight into critical differences between observed and inferred descriptions, and image quality issues in reports. Finally, we show how the CXR ontology can be used to anatomically structure labeled output.

Authors

  • Joy T Wu
    IBM Almaden Research Center, San Jose, CA.
  • Ali Syed
    IBM Almaden Research Center, San Jose, CA.
  • Hassan Ahmad
    IBM Almaden Research Center, San Jose, CA.
  • Anup Pillai
    IBM Almaden Research Center, San Jose, CA.
  • Yaniv Gur
    IBM Almaden Research Center, San Jose, CA.
  • Ashutosh Jadhav
    IBM Almaden Research Center, San Jose, CA.
  • Daniel Gruhl
    IBM Almaden Research Center, San Jose, CA.
  • Linda Kato
    IBM Almaden Research Center, San Jose, CA.
  • Mehdi Moradi
    IBM Almaden Research Center, San Jose, CA.
  • Tanveer Syeda-Mahmood
    IBM Almaden Research Center, San Jose, CA.