SVM-LncRNAPro: An SVM-Based Method for Predicting Long Noncoding RNA Promoters.
Journal:
IET systems biology
PMID:
40188358
Abstract
Long non-coding RNAs (lncRNAs) are closely associated with the regulation of gene expression, whose promoters play a crucial role in comprehensively understanding lncRNA regulatory mechanisms, functions and their roles in diseases. Due to limitations of the current techniques, accurately identifying lncRNA promoters remains a challenge. To address this challenge, we propose a support vector machine (SVM)-based method for predicting lncRNA promoters, called SVM-LncRNAPro. This method uses position-specific trinucleotide propensity based on single-strand (PSTNPss) to encode the DNA sequences and employs an SVM as the learning algorithm. The SVM-LncRNAPro achieves state-of-the-art performance with reduced complexity. Additionally, experiments demonstrate that this method exhibits a strong generalisation ability. For the convenience of academic research, we have made the source code of SVM-LncRNAPro publicly available. Researchers can download the code and perform the prediction of the lncRNA promoter via the following link: https://github.com/TG0F7/Prom/tree/master.