MAHyNet: Parallel Hybrid Network for RNA-Protein Binding Sites Prediction Based on Multi-Head Attention and Expectation Pooling.

Journal: IEEE/ACM transactions on computational biology and bioinformatics
PMID:

Abstract

RNA-binding proteins (RBPs) can regulate biological functions by interacting with specific RNAs, and play an important role in many life activities. Therefore, the rapid identification of RNA-protein binding sites is crucial for functional annotation and site-directed mutagenesis. In this work, a new parallel network that integrates the multi-head attention mechanism and the expectation pooling is proposed, named MAHyNet. The left-branch network of MAHyNet hybrids convolutional neural networks (CNNs) and gated recurrent neural network (GRU) to extract the features of one-hot. The right-branch network is a two-layer CNN network to analyze physicochemical properties of RNA base. Specifically, the multi-head attention mechanism is a computational collection of multiple independent layers of attention, which can extract feature information from multiple dimensions. The expectation pooling combines probabilistic thinking with global pooling. This approach helps to reduce model parameters and enhance the model performance. The combination of CNN and GRU enables further extraction of high-level features in sequences. In addition, the study shows that appropriate hyperparameters have a positive impact on the model performance. Physicochemical properties can be used to supplement characterization information to improving model performance. The experimental results show that MAHyNet has better performance than other models.

Authors

  • Wei Wang
    State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau, Macau 999078, China.
  • Zhenxi Sun
  • Dong Liu
    Department of Gastrointestinal Surgery, The Third Hospital of Hebei Medical University, Shijiazhuang, China.
  • Hongjun Zhang
    Ministry of Agriculture, Institute for the Control of Agrochemicals, No. 22 Maizidian Street, Beijing 110000, China.
  • Juntao Li
    College of Mathematics and Information Science, Henan Normal University, Xinxiang, 453007, China. Electronic address: juntaolimail@126.com.
  • Xianfang Wang
    School of Computer and Information Engineering, Henan Normal University, Xinxiang, 453007, People's Republic of China. 2wangfang@163.com.
  • Yun Zhou
    MOE Key Lab of Environmental and Occupational Health, School of Public Health, Tongji Medical College, Huazhong University of Science & Technology, Wuhan 430030, China.