Knowledge-aware contrastive heterogeneous molecular graph learning.

Journal: PLoS computational biology
PMID:

Abstract

Molecular representation learning is pivotal in predicting molecular properties and advancing drug design. Traditional methodologies, which predominantly rely on homogeneous graph encoding, are limited by their inability to integrate external knowledge and represent molecular structures across different levels of granularity. To address these limitations, we propose a paradigm shift by encoding molecular graphs into heterogeneous structures, introducing a novel framework: Knowledge-aware Contrastive Heterogeneous Molecular Graph Learning. This approach leverages contrastive learning to enrich molecular representations with embedded external knowledge. KCHML conceptualizes molecules through three distinct graph views-molecular, elemental, and pharmacological-enhanced by heterogeneous molecular graphs and a dual message-passing mechanism. This design offers a comprehensive representation for property prediction, as well as for downstream tasks such as drug-drug interaction prediction. Extensive benchmarking demonstrates KCHML's superiority over state-of-the-art molecular property prediction models, underscoring its ability to capture intricate molecular features.

Authors

  • Mukun Chen
    School of Computer Science, Wuhan University, Luojiashan Road, Wuchang District., Wuhan, 430072, Hubei Province, China. Electronic address: cmk0910@whu.edu.cn.
  • Jia Wu
  • Shirui Pan
    Faculty of Information Technology, Monash University, Clayton, Australia.
  • Fu Lin
    School of Pharmaceutical Sciences , Wenzhou Medical University , Wenzhou , Zhejiang 325035 , P. R. China.
  • Bo Du
    School of Computer Science, Wuhan University, Wuhan, 430072, China. Electronic address: remoteking@whu.edu.cn.
  • Xiuwen Gong
    University of Technology Sydney, 15 Broadway Ultimo, NSW 2007, Sydney, 2007, Australia. Electronic address: gongxiuwen@gmail.com.
  • Wenbin Hu