De Novo Drug Design by Multi-Objective Path Consistency Learning With Beam A Search.

Journal: IEEE/ACM transactions on computational biology and bioinformatics
Published Date:

Abstract

Generating high-quality and drug-like molecules from scratch within the expansive chemical space presents a significant challenge in the field of drug discovery. In prior research, value-based reinforcement learning algorithms have been employed to generate molecules with multiple desired properties iteratively. The immediate reward was defined as the evaluation of intermediate-state molecules at each step, and the learning objective would be maximizing the expected cumulative evaluation scores for all molecules along the generative path. However, this definition of the reward was misleading, as in reality, the optimization target should be the evaluation score of only the final generated molecule. Furthermore, in previous works, randomness was introduced into the decision-making process, enabling the generation of diverse molecules but no longer pursuing the maximum future rewards. In this paper, immediate reward is defined as the improvement achieved through the modification of the molecule to maximize the evaluation score of the final generated molecule exclusively. Originating from the A search, path consistency (PC), i.e., values on one optimal path should be identical, is employed as the objective function in the update of the value estimator to train a multi-objective de novo drug designer. By incorporating the value into the decision-making process of beam search, the DrugBA algorithm is proposed to enable the large-scale generation of molecules that exhibit both high quality and diversity. Experimental results demonstrate a substantial enhancement over the state-of-the-art algorithm QADD in multiple molecular properties of the generated molecules.

Authors

  • Dengwei Zhao
  • Jingyuan Zhou
    Department of Computer Science and Engineering, Shanghai Jiao Tong University, No. 800 Dong Chuan Road, Shanghai 200240, China.
  • Shikui Tu
    Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China. Electronic address: tushikui@sjtu.edu.cn.
  • Lei Xu
    Key Laboratory of Biomedical Information Engineering of the Ministry of Education, Department of Biomedical Engineering, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, China.