Data Element Mapping in the Data Privacy Era.

Journal: Studies in health technology and informatics
Published Date:

Abstract

Secondary use of health data is made difficult in part because of large semantic heterogeneity. Many efforts are being made to align local terminologies with international standards. With increasing concerns about data privacy, we focused here on the use of machine learning methods to align biological data elements using aggregated features that could be shared as open data. A 3-step methodology (features engineering, blocking strategy and supervised learning) was proposed. The first results, although modest, are encouraging for the future development of this approach.

Authors

  • Romain Griffier
    Department of Public Health, Faculty of medicine, University of Bordeaux, Bordeaux, France.
  • Sébastien Cossin
    Univ. Bordeaux, Inserm, Bordeaux Population Health Research Center, UMR 1219, Bordeaux, France.
  • François Konschelle
    Bordeaux University Hospital, Public health, 33000 Bordeaux, France.
  • Fleur Mougin
    Université Bordeaux, ISPED, Centre INSERM U897, ERIAS, France.
  • Vianney Jouhet
    ERIAS, INSERM U897, ISPED, Université Bordeaux, Bordeaux, France.