FactVAE: a factorized variational autoencoder for single-cell multi-omics data integration analysis.

Journal: Briefings in bioinformatics
PMID:

Abstract

Single-cell multi-omics technologies have revolutionized the study of cell states and functions by simultaneously profiling multiple molecular layers within individual cells. However, existing methods for integrating these data struggle to preserve critical feature information and fail to exploit known regulatory knowledge, which is essential for understanding cell functions. This limitation hinders their ability to provide comprehensive and accurate insights into cells. Here, we propose FactVAE, an innovative factorized variational autoencoder designed for the robust and accurate understanding of single-cell multi-omics data. FactVAE integrates the factorization principle into the variational autoencoder framework, ensuring the preservation of feature information while leveraging the non-linear capture of sample information by neural networks. Additionally, known regulatory knowledge is incorporated during model training, and a knowledge transfer strategy is employed for cell embedding optimization and data augmentation. Comparative analyses of single-cell multi-omics datasets from different protocols and the spatial multi-omics dataset demonstrate that FactVAE not only outperforms benchmark methods in clustering performance but also generates augmented data that reveals the clearest cell-type-specific motif expression. Moreover, the feature embeddings captured by FactVAE enable the inference of potential and reliable gene regulatory relationships. Overall, FactVAE's superior performance and strong scalability make it a promising new solution for single-cell multi-omics data analysis.

Authors

  • Linjie Wang
    School of Computer Science and Engineering, Northeastern University, Shenyang, China.
  • Huixia Zhang
    School of Computer Science and Engineering, Northeastern University, 110819, Shenyang, China.
  • Bo Yi
    1 Department of General Surgery, Third Xiangya Hospital, Central South University , Changsha, China .
  • Weidong Xie
    School of Computer Science and Engineering, Northeastern University, Shenyang, China.
  • Kun Yu
    College of Medicine and Biological Information Engineering, Northeastern University, Shenyang, Liaoning 110819, China.
  • Wei Li
    Department of Nephrology, The Second Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China.
  • Keqin Li
  • Dazhe Zhao
    Medical Image Computing Laboratory of Ministry of Education, Northeastern University, 110819, Shenyang, China.