Fusing 2D and 3D molecular graphs as unambiguous molecular descriptors for conformational and chiral stereoisomers.

Journal: Briefings in bioinformatics
PMID:

Abstract

The rapid progress of machine learning (ML) in predicting molecular properties enables high-precision predictions being routinely achieved. However, many ML models, such as conventional molecular graph, cannot differentiate stereoisomers of certain types, particularly conformational and chiral ones that share the same bonding connectivity but differ in spatial arrangement. Here, we designed a hybrid molecular graph network, Chemical Feature Fusion Network (CFFN), to address the issue by integrating planar and stereo information of molecules in an interweaved fashion. The three-dimensional (3D, i.e., stereo) modality guarantees precision and completeness by providing unabridged information, while the two-dimensional (2D, i.e., planar) modality brings in chemical intuitions as prior knowledge for guidance. The zipper-like arrangement of 2D and 3D information processing promotes cooperativity between them, and their synergy is the key to our model's success. Experiments on various molecules or conformational datasets including a special newly created chiral molecule dataset comprised of various configurations and conformations demonstrate the superior performance of CFFN. The advantage of CFFN is even more significant in datasets made of small samples. Ablation experiments confirm that fusing 2D and 3D molecular graphs as unambiguous molecular descriptors can not only effectively distinguish molecules and their conformations, but also achieve more accurate and robust prediction of quantum chemical properties.

Authors

  • Wenjie Du
    School of Software Engineering, University of Science and Technology of China, Hefei 230026, China; Suzhou Institute for Advanced Research, University of Science and Technology of China, Suzhou 215123, China.
  • Xiaoting Yang
    Suzhou Institute for Advanced Research, University of Science and Technology of China, Suzhou, Jiangsu 215123, China.
  • Di Wu
    University of Melbourne, Melbourne, VIC 3010 Australia.
  • FenFen Ma
    Suzhou Laboratory , Suzhou 215123, Jiangsu, China.
  • Baicheng Zhang
    Key Laboratory of Precision and Intelligent Chemistry, School of Chemistry and Materials Science, University of Science and Technology of China, China.
  • Chaochao Bao
    Suzhou Institute for Advanced Research, University of Science and Technology of China, Suzhou, Jiangsu 215123, China.
  • Yaoyuan Huo
    Suzhou Institute for Advanced Research, University of Science and Technology of China, Suzhou, Jiangsu 215123, China.
  • Jun Jiang
    Key Laboratory of Precision and Intelligent Chemistry, School of Chemistry and Materials Science, University of Science and Technology of China, China.
  • Xin Chen
    University of Nottingham, Nottingham, United Kingdom.
  • Yang Wang
    Department of General Surgery The First People's Hospital of Yunnan Province, The Affiliated Hospital of Kunming University of Science and Technology Kunming China.