Discerning novel splice junctions derived from RNA-seq alignment: a deep learning approach.

Journal: BMC genomics
Published Date:

Abstract

BACKGROUND: Exon splicing is a regulated cellular process in the transcription of protein-coding genes. Technological advancements and cost reductions in RNA sequencing have made quantitative and qualitative assessments of the transcriptome both possible and widely available. RNA-seq provides unprecedented resolution to identify gene structures and resolve the diversity of splicing variants. However, currently available ab initio aligners are vulnerable to spurious alignments due to random sequence matches and sample-reference genome discordance. As a consequence, a significant set of false positive exon junction predictions would be introduced, which will further confuse downstream analyses of splice variant discovery and abundance estimation.

Authors

  • Yi Zhang
    Department of Thyroid Surgery, China-Japan Union Hospital of Jilin University, Jilin University, Changchun, China.
  • Xinan Liu
    Department of Computer Science, University of Kentucky, Lexington, KY, 40506, USA.
  • James MacLeod
    Department of Veterinary Science, University of Kentucky, Lexington, KY, 40506, USA.
  • Jinze Liu