A Multi-view Molecular Pre-training with Generative Contrastive Learning.

Journal: Interdisciplinary sciences, computational life sciences
Published Date:

Abstract

Molecular representation learning can preserve meaningful molecular structures as embedding vectors, which is a necessary prerequisite for molecular property prediction. Yet, learning how to accurately represent molecules remains challenging. Previous approaches to learning molecular representations in an end-to-end manner potentially suffered information loss while neglecting the utilization of molecular generative representations. To obtain rich molecular feature information, the pre-training molecular representation model utilized different molecular representations to reduce information loss caused by a single molecular representation. Therefore, we provide the MVGC, a unique multi-view generative contrastive learning pre-training model. Our pre-training framework specifically acquires knowledge of three fundamental feature representations of molecules and effectively integrates them to predict molecular properties on benchmark datasets. Comprehensive experiments on seven classification tasks and three regression tasks demonstrate that our proposed MVGC model surpasses the majority of state-of-the-art approaches. Moreover, we explore the potential of the MVGC model to learn the representation of molecules with chemical significance.

Authors

  • Yunwu Liu
    School of Information Science and Engineering, Lanzhou University, 730000, Lanzhou, China. Electronic address: liuyw19@lzu.edu.cn.
  • Ruisheng Zhang
    School of Information Science & Engineering, Lanzhou University, Lanzhou, Gansu 730000, China. zhangrs@lzu.edu.cn.
  • Yongna Yuan
  • Jun Ma
    State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, Harbin 150090, China.
  • Tongfeng Li
    School of Information Science and Engineering, Lanzhou University, 730000, Lanzhou, China; Computer College, Qinghai Normal University, 810016, Xining, China. Electronic address: litf19@lzu.edu.cn.
  • Zhixuan Yu
    School of Information Science and Engineering, Lanzhou University, Lanzhou, 730000, China.