Semisupervised Contrastive Learning for Bioactivity Prediction Using Cell Painting Image Data.

Journal: Journal of chemical information and modeling
PMID:

Abstract

Morphological profiling has recently demonstrated remarkable potential for identifying the biological activities of small molecules. Alongside the fully supervised and self-supervised machine learning methods recently proposed for bioactivity prediction from Cell Painting image data, we introduce here a semisupervised contrastive (SemiSupCon) learning approach. This approach combines the strengths of using biological annotations in supervised contrastive learning and leveraging large unannotated image data sets with self-supervised contrastive learning. SemiSupCon enhances downstream prediction performance of classifying MeSH pharmacological classifications from PubChem, as well as mode of action and biological target annotations from the Drug Repurposing Hub across two publicly available Cell Painting data sets. Notably, our approach has effectively predicted the biological activities of several unannotated compounds, and these findings were validated through literature searches. This demonstrates that our approach can potentially expedite the exploration of biological activity based on Cell Painting image data with minimal human intervention.

Authors

  • David Bushiri Pwesombo
    Research Unit Structural Chemistry and Computational Biophysics, Leibniz-Forschungsinstitut für Molekulare Pharmakologie, Berlin 13125, Germany.
  • Carsten Beese
    Research Unit Structural Chemistry and Computational Biophysics, Leibniz-Forschungsinstitut für Molekulare Pharmakologie, Berlin 13125, Germany.
  • Christopher Schmied
    Research Unit Structural Chemistry and Computational Biophysics, Leibniz-Forschungsinstitut für Molekulare Pharmakologie, Berlin 13125, Germany.
  • Han Sun
    Division of Nephrology,Departmentof Geriatrics, The First Affiliated Hospital of Nanjing Medical University,300 Guangzhou Road, Nanjing, Jiangsu 210029, China.