CrosSplice: A Pipeline for Identifying Rare Splice-Site Creating Variants from Cross-Tissue Transcriptome Data

Journal: medRxiv
Published Date:

Abstract

Despite their profound impact on patients’ lives, most rare and intractable diseases still lack established treatments. Genomic variants that disrupt normal splicing by creating novel splice sites (splice-site creating variants, SSCVs) substantially contribute to the pathogenesis of those conditions. Deep intronic SSCVs are particularly amenable to antisense oligonucleotide (ASO)-mediated splice modulation, yet many of them remain undetected by conventional genomic analyses. Existing approaches to identify SSCVs, including splicing quantitative trait loci (sQTL) analyses and machine learning-based methods, each show limitations in sensitivity or accuracy, hindering the comprehensive identification of clinically actionable SSCVs. We developed CrosSplice, a novel pipeline that integrates machine learning–based predictions with statistical association testing to robustly identify SSCVs. By leveraging cross-tissue transcriptome data and aggregating splicing signals across multiple tissues, CrosSplice enables comprehensive detection of SSCVs, including rare and tissue-specific variants often missed by conventional methods. Applying CrosSplice to the GTEx dataset (8,656 transcriptomes of 54 tissues from 479 postmortem donors), we identified 1,743 significant SSCVs, 65% of which were deep intronic. Among these, 185 SSCVs were listed in ClinVar, including five pathogenic or likely pathogenic variants. CrosSplice also discovered a novel deep intronic SSCV in PLA2G6, the gene responsible for infantile neuroaxonal dystrophy. We experimentally confirmed that ASOs successfully corrected the aberrant splicing pattern induced by this variant. CrosSplice substantially extends the detectable landscape of SSCVs by capturing rare and tissue-specific variants, uncovering pathogenic and therapeutically actionable SSCVs that are frequently overlooked by existing methods. The resulting SSCV catalogue provides a platform for systematic discovery of ASO targets and advances opportunities for precision therapies in rare and intractable diseases.

Authors

  • Yuki Yano; Ai Okada; Misaki Ono; Raúl N. Mateos; Tojo Nakayama; Yuichi Shiraishi