Application of optical character recognition with natural language processing for large-scale quality metric data extraction in colonoscopy reports.

Journal: Gastrointestinal endoscopy
Published Date:

Abstract

BACKGROUND AND AIMS: Colonoscopy is commonly performed for colorectal cancer screening in the United States. Reports are often generated in a non-standardized format and are not always integrated into electronic health records. Thus, this information is not readily available for streamlining quality management, participating in endoscopy registries, or reporting of patient- and center-specific risk factors predictive of outcomes. We aim to demonstrate the use of a new hybrid approach using natural language processing of charts that have been elucidated with optical character recognition processing (OCR/NLP hybrid) to obtain relevant clinical information from scanned colonoscopy and pathology reports, a technology co-developed by Cleveland Clinic and eHealth Technologies (West Henrietta, NY, USA).

Authors

  • Sobia Nasir Laique
    Division of Gastroenterology and Hepatology, Mayo Clinic, Phoenix, Arizona, USA.
  • Umar Hayat
    Division of Gastroenterology, University of Minnesota, Minneapolis, Minnesota, USA.
  • Shashank Sarvepalli
    Department of Hospital Medicine, Cleveland Clinic, Cleveland, Ohio, USA; Department of Bioinformatics, Vanderbilt University, Nashville, Tennessee, USA.
  • Byron Vaughn
    Division of Gastroenterology, University of Minnesota, Minneapolis, Minnesota, USA.
  • Mounir Ibrahim
    Digestive Disease Institute, Cleveland Clinic, Cleveland, Ohio, USA.
  • John McMichael
    Digestive Disease Institute, Cleveland Clinic, Cleveland, Ohio, USA.
  • Kanza Noor Qaiser
    Department of Hospital Medicine, Cleveland Clinic, Cleveland, Ohio, USA.
  • Carol Burke
    Digestive Disease Institute, Cleveland Clinic, Cleveland, Ohio, USA.
  • Amit Bhatt
    Digestive Disease Institute, Cleveland Clinic, Cleveland, Ohio, USA.
  • Colin Rhodes
    eHealth Technology, West Henrietta, New York, New York, USA.
  • Maged K Rizk
    Digestive Disease Institute, Cleveland Clinic, Cleveland, Ohio, USA.