Estimating a Bias in ICD Encodings for Billing Purposes.

Journal: Studies in health technology and informatics
Published Date:

Abstract

ICD encoded diagnoses are a popular criterion for eligibility algorithms for study cohort recruitment. However, "official" ICD encoded diagnoses used for billing purposes are afflicted with a bias originating from legal issues. This work presents an approach to estimate the degree of the encoding bias for the complete ICD catalogue at a German university hospital. The free text diagnoses sections of discharge letters are automatically classified using a supervised machine learning algorithm. The automatic classifications are compared with the official, manually classified codes. For selected ICD codes the approach works sufficiently well.

Authors

  • Georg Fette
    Würzburg University, Computer Science 6.
  • Markus Krug
    Würzburg University, Computer Science 6.
  • Mathias Kaspar
    University Hospital of Würzburg, Comprehensive Heart Failure Center.
  • Leon Liman
    Würzburg University, Computer Science 6.
  • Georg Dietrich
    Würzburg University, Computer Science 6.
  • Maximilian Ertl
    University Hospital of Würzburg, Comprehensive Heart Failure Center.
  • Jonathan Krebs
    Würzburg University, Computer Science 6.
  • Stefan Störk
    Department Clinical Research and Epidemiology, Comprehensive Heart Failure Center, University Hospital Würzburg, Am Schwarzenberg 15, D-97078 Würzburg, Germany.
  • Frank Puppe
    Würzburg University, Computer Science 6.