A Hyperbolic Discrete Diffusion 3D RNA Inverse Folding Model for Functional RNA Design.

Journal: Journal of chemical information and modeling
Published Date:

Abstract

Generative design of functional RNAs presents revolutionary opportunities for diverse RNA-based biotechnologies and biomedical applications. To this end, RNA inverse folding is a promising strategy for generatively designing new RNA sequences that can fold into desired topological structures. However, three-dimensional (3D) RNA inverse folding remains highly challenging due to limited availability of experimentally derived 3D structural data and unique characteristics of RNA 3D structures. In this study, we propose RIdiffusion, a hyperbolic denoising diffusion generative RNA inverse folding model, for 3D RNA design tasks. By embedding geometric features of RNA 3D structures and topological properties into hyperbolic space, RIdiffusion efficiently recovers the distribution of nucleotides for targeted RNA 3D structures based on limited training samples using a discrete diffusion model. We perform extensive evaluations on RIdiffusion using different data sets and strict data-splitting strategies and the results demonstrate that RIdiffusion consistently outperforms baseline generative models for RNA inverse folding. This study introduces RIdiffusion as a powerful tool for the generative design of functional RNAs, even in structure-data-scarce scenarios. By leveraging geometric deep learning, RIdiffusion enhances performance and holds promise for diverse downstream applications.

Authors

  • Dongyue Hou
    School of Pharmaceutical Sciences, Shanghai Engineering Research Center of Immunotherapeutics, Fudan University, Shanghai 201203, China.
  • Shuai Zhang
    School of Information, Zhejiang University of Finance and Economics, Hangzhou, China.
  • Mengyao Ma
    Byterna Therapeutics, Shanghai 201203, China.
  • Hanbo Lin
    School of Pharmaceutical Sciences, Shanghai Engineering Research Center of Immunotherapeutics, Fudan University, Shanghai 201203, China.
  • Zheng Wan
    Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, Shanghai Frontiers Science Center of Molecule Intelligent Syntheses, School of Chemistry and Molecular Engineering, East China Normal University, 500 Dongchuan Road, Shanghai 200062, China.
  • Hui Zhao
    School of Mathematics and Computer Science, Shaanxi University of Technology, Hanzhong, 723000, Shaanxi, China.
  • Ruian Zhou
    Faculty of Sciences, Engineering and Technology, The University of Adelaide, Adelaide, South Australia 5070, Australia.
  • Xiao He
    Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland. xiao.he@bsse.ethz.ch.
  • Xian Wei
    MoE Engineering Research Center of Hardware/Software Co-Design Technology and Application, East China Normal University, Zhongshan North Road 3663, Shanghai 200062, China.
  • Dianwen Ju
    School of Pharmaceutical Sciences, Shanghai Engineering Research Center of Immunotherapeutics, Fudan University, Shanghai 201203, China.
  • Xian Zeng
    The College of Biomedical Engineering and Instrument Science, Zhejiang University, 310027 Hangzhou, Zhejiang, China.