DISC: a highly scalable and accurate inference of gene expression and structure for single-cell transcriptomes using semi-supervised deep learning.

Journal: Genome biology
Published Date:

Abstract

Dropouts distort gene expression and misclassify cell types in single-cell transcriptome. Although imputation may improve gene expression and downstream analysis to some degree, it also inevitably introduces false signals. We develop DISC, a novel deep learning network with semi-supervised learning to infer gene structure and expression obscured by dropouts. Compared with seven state-of-the-art imputation approaches on ten real-world datasets, we show that DISC consistently outperforms the other approaches. Its applicability, scalability, and reliability make DISC a promising approach to recover gene expression, enhance gene and cell structures, and improve cell type identification for sparse scRNA-seq data.

Authors

  • Yao He
    School of Environmental Science and Engineering, Institute of Environmental Health and Pollution Control, Guangdong University of Technology, Guangzhou 510006, China.
  • Hao Yuan
    Central Sterile Supply Department, Baoding No. 1 Central Hospital, Baoding, Hebei, China.
  • Cheng Wu
    Department of Automation, Tsinghua University, Beijing 100084, China. Electronic address: wuc@tsinghua.edu.cn.
  • Zhi Xie
    State Key Lab of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou 500040, China. Electronic address: xiezh8@sysu.edu.cn.