Chromatin accessibility prediction via a hybrid deep convolutional neural network.

Journal: Bioinformatics (Oxford, England)
Published Date:

Abstract

MOTIVATION: A majority of known genetic variants associated with human-inherited diseases lie in non-coding regions that lack adequate interpretation, making it indispensable to systematically discover functional sites at the whole genome level and precisely decipher their implications in a comprehensive manner. Although computational approaches have been complementing high-throughput biological experiments towards the annotation of the human genome, it still remains a big challenge to accurately annotate regulatory elements in the context of a specific cell type via automatic learning of the DNA sequence code from large-scale sequencing data. Indeed, the development of an accurate and interpretable model to learn the DNA sequence signature and further enable the identification of causative genetic variants has become essential in both genomic and genetic studies.

Authors

  • Qiao Liu
    MOE Key Laboratory of Bioinformatics; Bioinformatics Division and Center for Synthetic & Systems Biology, TNLIST; Department of Automation, Tsinghua University, Beijing 100084, China.
  • Fei Xia
    Department of Electrical Engineering, Stanford University, Stanford, CA 94305, USA.
  • Qijin Yin
    MOE Key Laboratory of Bioinformatics; Bioinformatics Division and Center for Synthetic & Systems Biology, TNLIST; Department of Automation, Tsinghua University, Beijing 100084, China.
  • Rui Jiang
    Department of Urology, The Affiliated Hospital of Southwest Medical University, Luzhou, China.