SVM-LncRNAPro: An SVM-Based Method for Predicting Long Noncoding RNA Promoters.

Journal: IET systems biology
PMID:

Abstract

Long non-coding RNAs (lncRNAs) are closely associated with the regulation of gene expression, whose promoters play a crucial role in comprehensively understanding lncRNA regulatory mechanisms, functions and their roles in diseases. Due to limitations of the current techniques, accurately identifying lncRNA promoters remains a challenge. To address this challenge, we propose a support vector machine (SVM)-based method for predicting lncRNA promoters, called SVM-LncRNAPro. This method uses position-specific trinucleotide propensity based on single-strand (PSTNPss) to encode the DNA sequences and employs an SVM as the learning algorithm. The SVM-LncRNAPro achieves state-of-the-art performance with reduced complexity. Additionally, experiments demonstrate that this method exhibits a strong generalisation ability. For the convenience of academic research, we have made the source code of SVM-LncRNAPro publicly available. Researchers can download the code and perform the prediction of the lncRNA promoter via the following link: https://github.com/TG0F7/Prom/tree/master.

Authors

  • Guohua Huang
    Provincial Key Laboratory of Informational Service for Rural Area of Southwestern Hunan, Shaoyang University, Shaoyang, China.
  • Taigan Xue
    College of Information Science and Engineering, Shaoyang University, Shaoyang, China.
  • Weihong Chen
    MOE Key Lab of Environmental and Occupational Health, School of Public Health, Tongji Medical College, Huazhong University of Science & Technology, Wuhan 430030, China; E-mail: wchen@mails.tjmu.edu.cn.
  • Liangliang Huang
    Software College, Northeastern University, Shenyang 110819, China.
  • Qi Dai
    The First Clinical Medical College, Guangxi University of Chinese Medicine, Nanning 530001, China.
  • Jinyun Jiang
    College of Information Science and Engineering, Shaoyang University, Shaoyang, Hunan 422000, China.