Towards a Semantic Data Harmonization Federated Infrastructure.

Journal: Studies in health technology and informatics
Published Date:

Abstract

Data integration is an increasing need in medical informatics projects like the EU Precise4Q project, in which multidisciplinary semantically and syntactically heterogeneous data across several institutions needs to be integrated. Besides, data sharing agreements often allow a virtual data integration only, because data cannot leave the source repository. We propose a data harmonization infrastructure in which data is virtually integrated by sharing a semantically rich common data representation that allows their homogeneous querying. This common data model integrates content from well-known biomedical ontologies like SNOMED CT by using the BTL2 upper level ontology, and is imported into a graph database. We successfully integrated three datasets and made some test queries showing the feasibility of the approach.

Authors

  • Catalina Martínez-Costa
    Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, Austria.
  • Francisco Abad-Navarro
    Departamento de Informática y Sistemas, Universidad de Murcia, Campus de Espinardo, 30100, Murcia, Spain.