High Accuracy Open-Source Clinical Data De-Identification: The CliniDeID Solution.

Journal: Studies in health technology and informatics
Published Date:

Abstract

Clinical data de-identification offers patient data privacy protection and eases reuse of clinical data. As an open-source solution to de-identify unstructured clinical text with high accuracy, CliniDeID applies an ensemble method combining deep and shallow machine learning with rule-based algorithms. It reached high recall and precision when recently evaluated with a selection of clinical text corpora.

Authors

  • Stephane Meystre
    Stephane Meystre, MD, PhD, is an Assistant Professor at the University of Utah and a Research Investigator in the IDEAS Center at the VA Salt Lake City Health Care System in Salt Lake City, UT.
  • Paul Heider
    Medical University of South Carolina, Charleston, South Carolina, USA.