Improving tRNAscan-SE Annotation Results via Ensemble Classifiers.

Journal: Molecular informatics

Published Date: Sep 14, 2015

Abstract

tRNAScan-SE is a tRNA detection program that is widely used for tRNA annotation; however, the false positive rate of tRNAScan-SE is unacceptable for large sequences. Here, we used a machine learning method to try to improve the tRNAScan-SE results. A new predictor, tRNA-Predict, was designed. We obtained real and pseudo-tRNA sequences as training data sets using tRNAScan-SE and constructed three different tRNA feature sets. We then set up an ensemble classifier, LibMutil, to predict tRNAs from the training data. The positive data set of 623 tRNA sequences was obtained from tRNAdb 2009 and the negative data set was the false positive tRNAs predicted by tRNAscan-SE. Our in silico experiments revealed a prediction accuracy rate of 95.1 % for tRNA-Predict using 10-fold cross-validation. tRNA-Predict was developed to distinguish functional tRNAs from pseudo-tRNAs rather than to predict tRNAs from a genome-wide scan. However, tRNA-Predict can work with the output of tRNAscan-SE, which is a genome-wide scanning method, to improve the tRNAscan-SE annotation results. The tRNA-Predict web server is accessible at http://datamining.xmu.edu.cn/∼gjs/tRNA-Predict.

Authors

Quan Zou
Jiasheng Guo

School of Information Science and Technology, Xiamen University, Xiamen 361005, China.
Ying Ju

School of Information Science and Engineering, Xiamen University, Xiamen, China.
Meihong Wu

School of Information Science and Technology, Xiamen University, Xiamen 361005, China.
Xiangxiang Zeng

Department of Computer Science, Hunan University, Changsha, China.
Zhiling Hong

Software School, Xiamen University, Xiamen 361005, China. hongzl@xmu.edu.cn.

Keywords

Databases, Nucleic Acid Molecular Sequence Annotation RNA, Transfer Sequence Analysis, DNA Software

External Resources

View on PubMed Access via DOI PubMed (27491037)

Improving tRNAscan-SE Annotation Results via Ensemble Classifiers.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals