scGrapHiC: deep learning-based graph deconvolution for Hi-C using single cell gene expression.

Journal: Bioinformatics (Oxford, England)
PMID:

Abstract

SUMMARY: Single-cell Hi-C (scHi-C) protocol helps identify cell-type-specific chromatin interactions and sheds light on cell differentiation and disease progression. Despite providing crucial insights, scHi-C data is often underutilized due to the high cost and the complexity of the experimental protocol. We present a deep learning framework, scGrapHiC, that predicts pseudo-bulk scHi-C contact maps using pseudo-bulk scRNA-seq data. Specifically, scGrapHiC performs graph deconvolution to extract genome-wide single-cell interactions from a bulk Hi-C contact map using scRNA-seq as a guiding signal. Our evaluations show that scGrapHiC, trained on seven cell-type co-assay datasets, outperforms typical sequence encoder approaches. For example, scGrapHiC achieves a substantial improvement of 23.2% in recovering cell-type-specific Topologically Associating Domains over the baselines. It also generalizes to unseen embryo and brain tissue samples. scGrapHiC is a novel method to generate cell-type-specific scHi-C contact maps using widely available genomic signals that enables the study of cell-type-specific chromatin interactions.

Authors

  • Ghulam Murtaza
    Department of Information Systems, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur 50603, Malaysia.
  • Byron Butaney
    Department of Computer Science, Brown University, 115 Waterman Street, Providence, RI, 02912, United States.
  • Justin Wagner
    Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD 20899, USA.
  • Ritambhara Singh