siRNADiscovery: a graph neural network for siRNA efficacy prediction via deep RNA sequence analysis.

Journal: Briefings in bioinformatics
PMID:

Abstract

The clinical adoption of small interfering RNAs (siRNAs) has prompted the development of various computational strategies for siRNA design, from traditional data analysis to advanced machine learning techniques. However, previous studies have inadequately considered the full complexity of the siRNA silencing mechanism, neglecting critical elements such as siRNA positioning on mRNA, RNA base-pairing probabilities, and RNA-AGO2 interactions, thereby limiting the insight and accuracy of existing models. Here, we introduce siRNADiscovery, a Graph Neural Network (GNN) framework that leverages both non-empirical and empirical rule-based features of siRNA and mRNA to effectively capture the complex dynamics of gene silencing. On multiple internal datasets, siRNADiscovery achieves state-of-the-art performance. Significantly, siRNADiscovery also outperforms existing methodologies in in vitro studies and on an externally validated dataset. Additionally, we develop a new data-splitting methodology that addresses the data leakage issue, a frequently overlooked problem in previous studies, ensuring the robustness and stability of our model under various experimental settings. Through rigorous testing, siRNADiscovery has demonstrated remarkable predictive accuracy and robustness, making significant contributions to the field of gene silencing. Furthermore, our approach to redefining data-splitting standards aims to set new benchmarks for future research in the domain of predictive biological modeling for siRNA.

Authors

  • Rongzhuo Long
    School of Basic Medicine and Clinical Pharmacy, China Pharmaceutical University, 211198, Nanjing, China.
  • Ziyu Guo
    School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai, China.
  • Da Han
    Institute of Molecular Medicine (IMM) and Department of Laboratory Medicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, 200240, Shanghai, China.
  • Boxiang Liu
    Institute of Deep Learning, Baidu Research, Sunnyvale, USA. jollier.liu@gmail.com.
  • Xudong Yuan
    ACON Pharmaceuticals, 2557 Route 130 S, Ste 3, Cranbury, NJ 08512, USA.
  • Guangyong Chen
    Shenzhen Institutes of Advanced Technology, Shenzhen 518055, Guangdong, China.
  • Pheng-Ann Heng
  • Liang Zhang