Boosting Granular Support Vector Machines for the Accurate Prediction of Protein-Nucleotide Binding Sites.

Journal: Combinatorial chemistry & high throughput screening
Published Date:

Abstract

AIM AND OBJECTIVE: The accurate identification of protein-ligand binding sites helps elucidate protein function and facilitate the design of new drugs. Machine-learning-based methods have been widely used for the prediction of protein-ligand binding sites. Nevertheless, the severe class imbalance phenomenon, where the number of nonbinding (majority) residues is far greater than that of binding (minority) residues, has a negative impact on the performance of such machine-learning-based predictors.

Authors

  • Yi-Heng Zhu
    School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China.
  • Jun Hu
    Jinling Clinical Medical College, Nanjing Medical University,Nanjing,Jiangsu 210002,China.
  • Yong Qi
    School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China.
  • Xiao-Ning Song
    School of Internet of Things, Jiangnan University, Wuxi 214122, China.
  • Dong-Jun Yu