Cross-Modal Graph Contrastive Learning with Cellular Images.

Journal: Advanced science (Weinheim, Baden-Wurttemberg, Germany)
Published Date:

Abstract

Constructing discriminative representations of molecules lies at the core of a number of domains such as drug discovery, chemistry, and medicine. State-of-the-art methods employ graph neural networks and self-supervised learning (SSL) to learn unlabeled data for structural representations, which can then be fine-tuned for downstream tasks. Albeit powerful, these methods are pre-trained solely on molecular structures and thus often struggle with tasks involved in intricate biological processes. Here, it is proposed to assist the learning of molecular representation by using the perturbed high-content cell microscopy images at the phenotypic level. To incorporate the cross-modal pre-training, a unified framework is constructed to align them through multiple types of contrastive loss functions, which is proven effective in the formulated novel tasks to retrieve the molecules and corresponding images mutually. More importantly, the model can infer functional molecules according to cellular images generated by genetic perturbations. In parallel, the proposed model can transfer non-trivially to molecular property predictions, and has shown great improvement over clinical outcome predictions. These results suggest that such cross-modality learning can bridge molecules and phenotype to play important roles in drug discovery.

Authors

  • Shuangjia Zheng
    Research Center for Drug Discovery, School of Pharmaceutical Sciences , Sun Yat-sen University , 132 East Circle at University City , Guangzhou 510006 , China.
  • Jiahua Rao
    School of Data and Computer Science , Sun Yat-sen University , Guangzhou 510006 , China.
  • Jixian Zhang
    Aladdin Healthcare Technologies Ltd.
  • Lianyu Zhou
    School of Informatics, Xiamen University, Xiamen, 361005, China.
  • Jiancong Xie
    School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, 510000, China.
  • Ethan Cohen
    IBENS, Ecole Normale Supérieure, PSL Research Institute, Paris, France.
  • Wei Lu
    Department of Pharmacy, Taihe Hospital, Hubei University of Medicine, Shiyan, China.
  • Chengtao Li
    School of Environmental Science and Engineering, Shaanxi University of Science and Technology, Xi'an 170021, China.
  • Yuedong Yang
    Institute for Glycomics and School of Information and Communication Technique, Griffith University, Parklands Dr. Southport, QLD 4222, Australia.