An ontology-based rare disease common data model harmonising international registries, FHIR, and Phenopackets.

Journal: Scientific data
PMID:

Abstract

Although rare diseases (RDs) affect over 260 million individuals worldwide, low data quality and scarcity challenge effective care and research. This work aims to harmonise the Common Data Set by European Rare Disease Registry Infrastructure, Health Level 7 Fast Healthcare Interoperability Base Resources, and the Global Alliance for Genomics and Health Phenopacket Schema into a novel rare disease common data model (RD-CDM), laying the foundation for developing international RD-CDMs aligned with these data standards. We developed a modular-based GitHub repository and documentation to account for flexibility, extensions and further development. Recommendations on the model's cardinalities are given, inviting further refinement and international collaboration. An ontology-based approach was selected to find a common denominator between the semantic and syntactic data standards. Our RD-CDM version 2.0.0 comprises 78 data elements, extending the ERDRI-CDS by 62 elements with previous versions implemented in four German university hospitals capturing real world data for development and evaluation. We identified three categories for evaluation: Medical Data Granularity, Clinical Reasoning and Medical Relevance, and Interoperability and Harmonisation.

Authors

  • Adam S L Graefe
    Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany. adam.graefe@charite.de.
  • Miriam R Hübner
    Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany.
  • Filip Rehburg
    Berlin Institute of Health at Charité, Germany.
  • Steffen Sander
    Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany.
  • Sophie A I Klopfenstein
    Institute of Medical Informatics, Charité - Universitätsmedizin Berlin, Germany.
  • Samer Alkarkoukly
    CECAD, Faculty of Medicine and University Hospital Cologne, University of Cologne, Joseph-Stelzmann-Straße 26, 50931 Cologne.
  • Ana Grönke
    Medical Data Integration Center (MeDIC), University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne, Germany.
  • Annic Weyersberg
    Department of Paediatrics, University Hospital Cologne, Cologne, Germany.
  • Daniel Danis
    Monarch Initiative, monarchinitiative.org.
  • Jana Zschüntzsch
    Department of Neurology, University Medical Center Goettingen, Goettingen, Germany.
  • Elisabeth F Nyoungui
    Department of Medical Informatics, University Medical Center Goettingen, Goettingen, Germany.
  • Susanna Wiegand
    Department of Pediatric Endocrinology and Diabetology, Charité University Hospital, Berlin, Germany.
  • Peter Kühnen
    Department of Pediatric Endocrinology and Diabetology, Charité University Hospital, Berlin, Germany.
  • Peter N Robinson
    The Jackson Laboratory for Genomic Medicine Farmington CT 06032 USA.
  • Oya Beyan
    Institute for Biomedical Informatics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany.
  • Sylvia Thun
    Charité Universitätsmedizin, Berlin Institute of Health, Germany.