scMODAL: a general deep learning framework for comprehensive single-cell multi-omics data alignment with feature links.

Journal: Nature communications
Published Date:

Abstract

Recent advancements in single-cell technologies have enabled comprehensive characterization of cellular states through transcriptomic, epigenomic, and proteomic profiling at single-cell resolution. These technologies have significantly deepened our understanding of cell functions and disease mechanisms from various omics perspectives. As these technologies evolve rapidly and data resources expand, there is a growing need for computational methods that can integrate information from different modalities to facilitate joint analysis of single-cell multi-omics data. However, integrating single-cell omics datasets presents unique challenges due to varied feature correlations and technology-specific limitations. To address these challenges, we introduce scMODAL, a deep learning framework tailored for single-cell multi-omics data alignment using feature links. scMODAL integrates datasets with limited known positively correlated features, leveraging neural networks and generative adversarial networks to align cell embeddings and preserve feature topology. Our experiments demonstrate scMODAL's effectiveness in removing unwanted variation, preserving biological information, and accurately identifying cell subpopulations across diverse datasets. scMODAL not only advances integration tasks but also supports downstream analyses such as feature imputation and feature relationship inference, offering a robust solution for advancing single-cell multi-omics research.

Authors

  • Gefei Wang
    Department of Biostatistics, Yale University, New Haven, CT, USA.
  • Jia Zhao
    1 The Nursing College of Zhengzhou University, Zhengzhou 450052, China ; 2 Department of Thoracic Surgery, The First Affiliated Hospital of Zhengzhou University, Zhengzhou 450052, China.
  • Yingxin Lin
    Department of Biostatistics, Yale University, New Haven, CT, USA.
  • Tianyu Liu
    Department of Automation, Tsinghua University,Beijing, China.
  • Yize Zhao
    Department of Biostatistics, Yale University, New Haven, CT, USA.
  • Hongyu Zhao
    SJTU-Yale Joint Center for Biostatistics, Shanghai Jiao Tong University, 800 Dong Chuan Road, Shanghai 200240, China; Department of Biostatistics, Yale University, New Heaven, USA.