GR-m6A: Prediction of N6-methyladenosine sites in mammals with molecular graph and residual network.
Journal:
Computers in biology and medicine
Published Date:
Jun 30, 2023
Abstract
RNA N6-methyladenine (m6A), which is produced by the methylation of the N6 position of eukaryotic adenine, is a relatively common post-transcriptional modification on the surface of the molecule, which frequently plays a crucial role in biological processes. Biological experimental methods to identify m6A have been studied and implemented in recent years, but they cannot be promoted widely due to drawbacks such as the time and cost of reagents and equipment. Therefore, researchers have proposed computational strategies for identifying m6A sites, but these strategies do not account for the mechanism of methylation occurrence or the structure of RNA molecules. This study, therefore, proposed a novel deep learning model for predicting m6A sites, GR-m6A, which predicts m6A sites by extracting features from the physicochemical properties and spatial structure of molecules via residual networks. In GR-m6A, each RNA base string is represented by SMILES as two matrices comprising topology structural information and node attributes with molecular physicochemical characteristics. The feature encoding matrix was then obtained by fusing the topology matrix and the node matrix in accordance with the graphical convolutional network principle. Correspondingly, the more discriminative features were extracted from the encoding matrix using the residual neural network and predicted using a multilayer perceptron. As evident from the 5-fold cross-validation and independent validation, the GR-m6A model outperformed other existing methods. Thus, we hope that GR-m6A can aid researchers in predicting mammalian m6A loci. The source code and database are available at https://github.com/YingLiangjxau/GR-m6A.