Machine learning-optimized targeted detection of alternative splicing.

Journal: Nucleic acids research
PMID:

Abstract

RNA sequencing (RNA-seq) is widely adopted for transcriptome analysis but has inherent biases that hinder the comprehensive detection and quantification of alternative splicing. To address this, we present an efficient targeted RNA-seq method that greatly enriches for splicing-informative junction-spanning reads. Local splicing variation sequencing (LSV-seq) utilizes multiplexed reverse transcription from highly scalable pools of primers anchored near splicing events of interest. Primers are designed using Optimal Prime, a novel machine learning algorithm trained on the performance of thousands of primer sequences. In experimental benchmarks, LSV-seq achieves high on-target capture rates and concordance with RNA-seq, while requiring significantly lower sequencing depth. Leveraging deep learning splicing code predictions, we used LSV-seq to target events with low coverage in GTEx RNA-seq data and newly discover hundreds of tissue-specific splicing events. Our results demonstrate the ability of LSV-seq to quantify splicing of events of interest at high-throughput and with exceptional sensitivity.

Authors

  • Kevin Yang
    Computer Science and Artificial Intelligence Laboratory , MIT , Cambridge , Massachusetts 02139 , United States.
  • Nathaniel Islas
    Department of Computer and Information Science, University of Pennsylvania, Philadelphia, PA 19104, USA.
  • San Jewell
    Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA.
  • Di Wu
    University of Melbourne, Melbourne, VIC 3010 Australia.
  • Anupama Jha
    Department of Computer and Information Science, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, USA.
  • Caleb M Radens
    Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA.
  • Jeffrey A Pleiss
    Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA.
  • Kristen W Lynch
    Department of Biochemistry and Biophysics, Philadelphia, USA.
  • Yoseph Barash
    Department of Electrical and Computer Engineering, University of Toronto, Toronto, Ontario M5S 3G4, Canada. Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada. School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.
  • Peter S Choi
    Department of Pathology & Laboratory Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA.