FKRR-MVSF: A Fuzzy Kernel Ridge Regression Model for Identifying DNA-Binding Proteins by Multi-View Sequence Features via Chou's Five-Step Rule.

Journal: International journal of molecular sciences
Published Date:

Abstract

DNA-binding proteins play an important role in cell metabolism. In biological laboratories, the detection methods of DNA-binding proteins includes yeast one-hybrid methods, bacterial singles and X-ray crystallography methods and others, but these methods involve a lot of labor, material and time. In recent years, many computation-based approachs have been proposed to detect DNA-binding proteins. In this paper, a machine learning-based method, which is called the Fuzzy Kernel Ridge Regression model based on Multi-View Sequence Features (FKRR-MVSF), is proposed to identifying DNA-binding proteins. First of all, multi-view sequence features are extracted from protein sequences. Next, a Multiple Kernel Learning (MKL) algorithm is employed to combine multiple features. Finally, a Fuzzy Kernel Ridge Regression (FKRR) model is built to detect DNA-binding proteins. Compared with other methods, our model achieves good results. Our method obtains an accuracy of 83.26% and 81.72% on two benchmark datasets (PDB1075 and compared with PDB186), respectively.

Authors

  • Yi Zou
    Department of Cardiology, Beijing Anzhen Hospital, Capital Medical University, Beijing, China.
  • Yijie Ding
    School of Computer Science and Technology, Tianjin University, Tianjin 300350, China. wuxi_dyj@tju.edu.cn.
  • Jijun Tang
    School of Computer Science and Engineering, Tianjin University, Tianjin, 300072, China. jtang@cse.sc.edu.
  • Fei Guo
    School of Electronic Information Engineering, Tianjin University, Tianjin 300072, China. Electronic address: gfjy001@yahoo.com.
  • Li Peng
    School of Internet of Things Engineering, Jiangnan University, Wuxi 214122, China. pengli@jiangnan.edu.cn.