Macrocyclization of linear molecules by deep learning to facilitate macrocyclic drug candidates discovery.

Journal: Nature communications
PMID:

Abstract

Interest in macrocycles as potential therapeutic agents has increased rapidly. Macrocyclization of bioactive acyclic molecules provides a potential avenue to yield novel chemical scaffolds, which can contribute to the improvement of the biological activity and physicochemical properties of these molecules. In this study, we propose a computational macrocyclization method based on Transformer architecture (which we name Macformer). Leveraging deep learning, Macformer explores the vast chemical space of macrocyclic analogues of a given acyclic molecule by adding diverse linkers compatible with the acyclic molecule. Macformer can efficiently learn the implicit relationships between acyclic and macrocyclic structures represented as SMILES strings and generate plenty of macrocycles with chemical diversity and structural novelty. In data augmentation scenarios using both internal ChEMBL and external ZINC test datasets, Macformer display excellent performance and generalisability. We showcase the utility of Macformer when combined with molecular docking simulations and wet lab based experimental validation, by applying it to the prospective design of macrocyclic JAK2 inhibitors.

Authors

  • Yanyan Diao
    Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science & Technology, Shanghai, 200237, China.
  • Dandan Liu
    Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN.
  • Huan Ge
    Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science & Technology, Shanghai, 200237, China.
  • Rongrong Zhang
    Department of Psychiatry Affiliated Nanjing Brain Hospital, Nanjing Medical University, Nanjing 210029, China.
  • Kexin Jiang
    College of Medicine, Southwest Jiaotong University, Chengdu, China.
  • Runhui Bao
    Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science & Technology, Shanghai, 200237, China.
  • Xiaoqian Zhu
    Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science & Technology, Shanghai, 200237, China.
  • Hongjie Bi
    Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science & Technology, Shanghai, 200237, China.
  • Wenjie Liao
    Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science & Technology, Shanghai, 200237, China.
  • Ziqi Chen
    Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science & Technology, Shanghai, 200237, China.
  • Kai Zhang
    Anhui Province Key Laboratory of Respiratory Tumor and Infectious Disease, First Affiliated Hospital of Bengbu Medical University, Bengbu, China.
  • Rui Wang
    Department of Clinical Laboratory Medicine Center, Inner Mongolia Autonomous Region People's Hospital, Hohhot, Inner Mongolia, China.
  • Lili Zhu
    Department of Ultrasound, First Affiliated Hospital of Soochow University, Suzhou City, Jiangsu Province, China.
  • Zhenjiang Zhao
    Department of Mathematics, Huzhou Teachers College, Huzhou 313000, China. Electronic address: zhaozjcn@163.com.
  • Qiaoyu Hu
    Shanghai Institute for Advanced Immunochemical Studies, ShanghaiTech University, 393 Middle Huaxia Road, Shanghai, 201210, China.
  • Honglin Li
    Innovation Center for AI and Drug Discovery, East China Normal University, China.