Predicting lysine lipoylation sites using bi-profile bayes feature extraction and fuzzy support vector machine algorithm.

Journal: Analytical biochemistry
Published Date:

Abstract

Lipoylation is a highly conserved post-translational modification which has been found to be involved in many biological processes and closely associated with various metabolic diseases. The accurate identification of lipoylation sites is necessary to elucidate the underlying molecular mechanisms of lipoylation. As the traditional experimental methods are time consuming and expensive, it is desired to develop computational methods to predict lipoylation sites. In this study, a novel predictor named LipoPred is proposed to predict lysine lipoylation sites. On the one hand, an effective feature extraction method, bi-profile bayes encoding, is employed to encode lipoylation sites. On the other hand, a fuzzy support vector machine algorithm is proposed to solve the class imbalance and noise problem in the prediction of lipoylation sites. As illustrated by 10-fold cross-validation, LipoPred achieves an excellent performance with a Matthew's correlation coefficient of 0.9930. Therefore, LipoPred can be a useful bioinformatics tool for the prediction of lipoylation sites. Feature analysis shows that some residues around lipoylation sites may play an important role in the prediction. The results of analysis and prediction could offer useful information for elucidating the molecular mechanisms of lipoylation. A user-friendly web-server for LipoPred is established at 123.206.31.171/LipoPred/.

Authors

  • Zhe Ju
    School of Control Science and Engineering, Dalian University of Technology, #2 Ling-gong Road, Dalian 116024, People׳s Republic of China. Electronic address: juzhe1120@hotmail.com.
  • Shi-Yun Wang
    College of Science, Shenyang Aerospace University, 110136, PR China.