HDMC: a novel deep learning-based framework for removing batch effects in single-cell RNA-seq data.

Journal: Bioinformatics (Oxford, England)
Published Date:

Abstract

MOTIVATION: With the development of single-cell RNA sequencing (scRNA-seq) techniques, increasingly more large-scale gene expression datasets become available. However, to analyze datasets produced by different experiments, batch effects among different datasets must be considered. Although several methods have been recently published to remove batch effects in scRNA-seq data, two problems remain to be challenging and not completely solved: (i) how to reduce the distribution differences of different batches more accurately; and (ii) how to align samples from different batches to recover the cell type clusters.

Authors

  • Xiao Wang
    Research Centre of Basic Integrative Medicine, School of Basic Medical Sciences, Guangzhou University of Chinese Medicine, Guangzhou, Guangdong, China.
  • Jia Wang
    Institute of Special Animal and Plant Sciences, Chinese Academy of Agricultural Sciences, Changchun, Jilin, China.
  • Han Zhang
    Johns Hopkins University, Baltimore, MD, USA.
  • Shenwei Huang
    College of Computer Science, Nankai University, Tongyan Road, 300350, Tianjin, China.
  • Yanbin Yin
    Nebraska Food for Health Center, Department of Food Science and Technology, University of Nebraska-Lincoln, 1400 R Street, Lincoln, NE, 68588, USA.