From Silos to Synthesis: A comprehensive review of domain adaptation strategies for multi-source data integration in healthcare.

Journal: Computers in biology and medicine
Published Date:

Abstract

BACKGROUND: The integration of data from diverse sources is not only crucial for addressing data scarcity in health informatics but also enables the use of complementary information from multiple datasets. However, the isolated nature of data collected from disparate sources (referred to as 'Silos') presents significant challenges in multi-source data integration due to inherent heterogeneity and differences in data structures, formats, and standards. Domain adaptation emerges as a key framework to transition from 'Silos' to 'Synthesis' by measuring and mitigating such discrepancies, enabling uniform representation and harmonization of multi-source data.

Authors

  • Shelia Rahman Tuly
    Department of Computer Science, Wayne State University, 5057 Woodward Ave, Detroit, 48201, MI, USA. Electronic address: shelia.tuly@wayne.edu.
  • Sima Ranjbari
    Department of Computer Science, Wayne State University, 5057 Woodward Ave, Detroit, 48201, MI, USA. Electronic address: sima.ranjbari@wayne.edu.
  • Ekrem Alper Murat
    Department of Industrial and Systems Engineering, Wayne State University, 4th Street, Detroit, 48201, MI, USA. Electronic address: amurat@wayne.edu.
  • Suzan Arslanturk
    Department of Computer Science, Wayne State University, Detroit, 48201, USA. suzan.arslanturk@wayne.edu.