CWLy-pred: A novel cell wall lytic enzyme identifier based on an improved MRMD feature selection method.

Journal: Genomics
PMID:

Abstract

Cell wall lytic enzymes play key roles in biochemical, morphological, genetic research and industry fields. To save time and labor costs, bioinformatic methods are usually adopted to narrow the scope of in vitro experimentation. In this paper, we established a novel machine learning (support vector machine) based identifier called CWLy-pred to identify cell wall lytic enzymes. An improved MRMD feature selection method is also proposed to select the optimal training set to avoid data redundancy. CWLy-pred obtains an accuracy of 93.067%, a sensitivity of 85.3%, a specificity of 94.8%, an MCC of 0.775 and an AUC of 0.900. It outperforms the state-of-the-art identifier in terms of accuracy, sensitivity, specificity and MCC. Our proposed model is based on a feature set of only 6 dimensions; therefore, it not only can overcome overfitting problems but can also supervise biological experiments effectively. CWLy-pred is embedded in a web application at http://server.malab.cn/CWLy-pred/index.jsp, which is accessible for free.

Authors

  • Chaolu Meng
    College of Intelligence and Computing, Tianjin University, 300350, Tianjin, China.
  • Jin Wu
    School of Information and Software Engineering, University of Electronic Science and Technology of China, China. Electronic address: wj@uestc.edu.cn.
  • Fei Guo
    School of Electronic Information Engineering, Tianjin University, Tianjin 300072, China. Electronic address: gfjy001@yahoo.com.
  • Benzhi Dong
    Information and Computer Engineering College, Northeast Forestry University, Harbin, China. Electronic address: nefu_dbz@163.com.
  • Lei Xu
    Key Laboratory of Biomedical Information Engineering of the Ministry of Education, Department of Biomedical Engineering, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, China.