Compound-protein interaction prediction by deep learning: Databases, descriptors and models.

Journal: Drug discovery today
Published Date:

Abstract

The screening of compound-protein interactions (CPIs) is one of the most crucial steps in finding hit and lead compounds. Deep learning (DL) methods for CPI prediction can address intrinsic limitations of traditional HTS and virtual screening with the advantage of low cost and high efficiency. This review provides a comprehensive survey of DL-based CPI prediction. It first summarizes popular databases of small-molecule compounds, proteins and binding complexes. Then, it outlines classical representations of compounds and proteins in turn. After that, this review briefly introduces state-of-the-art DL-based models in terms of design paradigms and investigates their prediction performance. Finally, it indicates current challenges and trends toward better CPI prediction and sketches out crucial approaches toward practical applications.

Authors

  • Bing-Xue Du
    School of Life Sciences, Northwestern Polytechnical University, Xi'an 710072, China.
  • Yuan Qin
    College of Life Sciences and Medicine, Zhejiang Sci-Tech University, Hangzhou, China.
  • Yan-Feng Jiang
    School of Life Sciences, Northwestern Polytechnical University, Xi'an 710072, China.
  • Yi Xu
    School of Food and Biological Engineering, Jiangsu University, Zhenjiang 212013, PR China.
  • Siu-Ming Yiu
    2 Department of Computer Science, The University of Hong Kong , Pokfulam, Hong Kong .
  • Hui Yu
    Engineering Technology Research Center of Shanxi Province for Opto-Electric Information and Instrument, Taiyuan 030051, China. 13934603474@nuc.edu.cn.
  • Jian-Yu Shi
    School of Life Sciences, Northwestern Polytechnical University, Xi'an, Shaanxi 710072, China.