A Feature Fusion Predictor for RNA Pseudouridine Sites with Particle Swarm Optimizer Based Feature Selection and Ensemble Learning Approach.

Journal: Current issues in molecular biology
Published Date:

Abstract

RNA pseudouridine modification is particularly important in a variety of cellular biological and physiological processes. It plays a significant role in understanding RNA functions, RNA structure stabilization, translation processes, etc. To understand its functional mechanisms, it is necessary to accurately identify pseudouridine sites in RNA sequences. Although some computational methods have been proposed for the identification of pseudouridine sites, it is still a challenge to improve the identification accuracy and generalization ability. To address this challenge, a novel feature fusion predictor, named PsoEL-PseU, is proposed for the prediction of pseudouridine sites. Firstly, this study systematically and comprehensively explored different types of feature descriptors and determined six feature descriptors with various properties. To improve the feature representation ability, a binary particle swarm optimizer was used to capture the optimal feature subset for six feature descriptors. Secondly, six individual predictors were trained by using the six optimal feature subsets. Finally, to fuse the effects of all six features, six individual predictors were fused into an ensemble predictor by a parallel fusion strategy. Ten-fold cross-validation on three benchmark datasets indicated that the PsoEL-PseU predictor significantly outperformed the current state-of-the-art predictors. Additionally, the new predictor achieved better accuracy in the independent dataset evaluation-accuracy which is significantly higher than that of its existing counterparts-and the user-friendly webserver developed by the PsoEL-PseU predictor has been made freely accessible.

Authors

  • Xiao Wang
    Research Centre of Basic Integrative Medicine, School of Basic Medical Sciences, Guangzhou University of Chinese Medicine, Guangzhou, Guangdong, China.
  • Xi Lin
    Department of Ultrasound, Sun Yat-sen University Cancer Center, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, 651 Dongfeng East Road, Guangzhou 510060, China. Electronic address: linxi@sysucc.org.cn.
  • Rong Wang
    College of Food Science and Engineering, Northwest A&F University, Yangling 712100, Shanxi, China. Electronic address: wangrong91@nwsuaf.edu.cn.
  • Nijia Han
    School of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China.
  • Kaiqi Fan
    School of Material and Chemical Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China.
  • Lijun Han
    School of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China.
  • Zhaoyuan Ding
    School of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China.