EHR Text Categorization for Enhanced Patient-Based Document Navigation.

Journal: Studies in health technology and informatics
Published Date:

Abstract

Patients with multiple disorders usually have long diagnosis lists, constitute by ICD-10 codes together with individual free-text descriptions. These text snippets are produced by overwriting standardized ICD-Code topics by the physicians at the point of care. They provide highly compact expert descriptions within a 50-character long text field frequently not assigned to a specific ICD-10 code. The high redundancy of these lists would benefit from content-based categorization within different hospital-based application scenarios. This work demonstrates how to accurately group diagnosis lists via a combination of natural language processing and hierarchical clustering with an overall F-measure value of 0.87. In addition, it compresses the initial diagnosis list up to 89%. The manuscript discusses pitfall and challenges as well as the potential of a large-scale approach for tackling this problem.

Authors

  • Markus Kreuzthaler
    Institute of Medical Informatics, Statistics, and Documentation, Medical University of Graz, Austria.
  • Bastian Pfeifer
    Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, Austria.
  • José Antonio Vera Ramos
    Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, Austria.
  • Diether Kramer
    Steiermärkische Krankenanstaltengesellschaft m.b.H. (KAGes), Graz, Austria.
  • Victor Grogger
    KAGes Steiermärkische Krankenanstaltengesellschaft m.b.H., Graz, Austria.
  • Sylvia Bredenfeldt
    KAGes Steiermärkische Krankenanstaltengesellschaft m.b.H., Graz, Austria.
  • Markus Pedevilla
    KAGes Steiermärkische Krankenanstaltengesellschaft m.b.H., Graz, Austria.
  • Peter Krisper
    Division of Nephrology and Dialysis, Department of Internal Medicine, Medical University of Graz, Austria.
  • Stefan Schulz
    Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, Austria.