Retro Drug Design: From Target Properties to Molecular Structures.

Journal: Journal of chemical information and modeling
Published Date:

Abstract

To deliver more therapeutics to more patients more quickly and economically is the ultimate goal of pharmaceutical researchers. The advent and rapid development of artificial intelligence (AI), in combination with other powerful computational methods in drug discovery, makes this goal more practical than ever before. Here, we describe a new strategy, retro drug design, or RDD, to create novel small-molecule drugs from scratch to meet multiple predefined requirements, including biological activity against a drug target and optimal range of physicochemical and ADMET properties. The molecular structure was represented by an atom typing based molecular descriptor system, optATP, which was further transformed to the space of loading vectors from principal component analysis. Traditional predictive models were trained over experimental data for the target properties using optATP and shallow machine learning methods. The Monte Carlo sampling algorithm was then utilized to find the solutions in the space of loading vectors that have the target properties. Finally, a deep learning model was employed to decode molecular structures from the solutions. To test the feasibility of the algorithm, we challenged RDD to generate novel kinase inhibitors from random numbers with five different ADMET properties optimized at the same time. The best Tanimoto similarity score between the generated valid structures and the available 4,314 kinase inhibitors was < 0.50, indicating a high extent of novelty of the generated compounds. From the 3,040 structures that met all six target properties, 20 were selected for synthesis and experimental measurement of inhibition activity over 97 representative kinases and the ADMET properties. Fifteen and eight compounds were determined to be hits or strong hits, respectively. Five of the six strong kinase inhibitors have excellent experimental ADMET properties. The results presented in this paper illustrate that RDD has the potential to significantly improve the current drug discovery process.

Authors

  • Yuhong Wang
    Wuhan Institute for Food and Cosmetic Control, Wuhan 430014, China.
  • Sam Michael
    National Center for Advancing Translational Sciences (NCATS), Division of Preclinical Innovation (DPI), Stem Cell Translation Laboratory (SCTL), National Institutes of Health (NIH), 9800 Medical Center Drive, Rockville, MD 20850, USA.
  • Shyh-Ming Yang
    National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States.
  • Ruili Huang
    National Center for Advancing Translational Sciences (NCATS) , National Institutes of Health , 9800 Medical Center Drive , Rockville , Maryland 20850 , United States.
  • Kennie Cruz-Gutierrez
    National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States.
  • Yaqing Zhang
    Department of Orthopedics, Xinqiao Hospital, Army Medical University, Chongqing 400037, P.R. China.
  • Jinghua Zhao
    Hubei Key Laboratory of Material Chemistry and Service Failure, Hubei Engineering Research Center for Biomaterials and Medical Protective Materials, School of Chemistry and Chemical Engineering, Huazhong University of Science and Technology, Wuhan, Hubei 430074, People's Republic of China.
  • Menghang Xia
    Division of Preclinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, Maryland 20850, United States.
  • Paul Shinn
    National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States.
  • Hongmao Sun
    National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, Bethesda, MD 20892, USA. Electronic address: sunh7@mail.nih.gov.