HimGNN: a novel hierarchical molecular graph representation learning framework for property prediction.

Journal: Briefings in bioinformatics
Published Date:

Abstract

Accurate prediction of molecular properties is an important topic in drug discovery. Recent works have developed various representation schemes for molecular structures to capture different chemical information in molecules. The atom and motif can be viewed as hierarchical molecular structures that are widely used for learning molecular representations to predict chemical properties. Previous works have attempted to exploit both atom and motif to address the problem of information loss in single representation learning for various tasks. To further fuse such hierarchical information, the correspondence between learned chemical features from different molecular structures should be considered. Herein, we propose a novel framework for molecular property prediction, called hierarchical molecular graph neural networks (HimGNN). HimGNN learns hierarchical topology representations by applying graph neural networks on atom- and motif-based graphs. In order to boost the representational power of the motif feature, we design a Transformer-based local augmentation module to enrich motif features by introducing heterogeneous atom information in motif representation learning. Besides, we focus on the molecular hierarchical relationship and propose a simple yet effective rescaling module, called contextual self-rescaling, that adaptively recalibrates molecular representations by explicitly modelling interdependencies between atom and motif features. Extensive computational experiments demonstrate that HimGNN can achieve promising performances over state-of-the-art baselines on both classification and regression tasks in molecular property prediction.

Authors

  • Shen Han
    College of Informatics, Huazhong Agricultural University, People's Republic of China.
  • Haitao Fu
    College of Informatics, Huazhong Agricultural University, People's Republic of China.
  • Yuyang Wu
    College of Plant Science and Technology, Huazhong Agricultural University, People's Republic of China.
  • Ganglan Zhao
    College of Informatics, Huazhong Agricultural University, People's Republic of China.
  • Zhenyu Song
    College of Informatics, Huazhong Agricultural University, People's Republic of China.
  • Feng Huang
    Beijing Hospital of TCM, Capital Medical University, Beijing 100010, China; Institution of Acupuncture and Moxibustion, China Academy of Chinese Medical Sciences, Beijing 100700.
  • Zhongfei Zhang
  • Shichao Liu
  • Wen Zhang
    Oil Crops Research Institute, Chinese Academy of Agricultural Sciences Wuhan 430062 China peiwuli@oilcrops.cn zhangqi521x@126.com +86-27-8681-2943 +86-27-8671-1839.