Robust and interpretable prediction of gene markers and cell types from spatial transcriptomics data.

Journal: Nature communications
Published Date:

Abstract

Spatial transcriptomics (ST) links tissue morphology with gene expression values, opening new avenues for digital pathology. Deep learning models are used to predict gene expression or classify cell types directly from images, offering significant clinical potential but still requiring improvements in interpretability and robustness. We present STimage as a comprehensive suite of models to predict spatial gene expression and classify cell types directly from standard H&E images. STimage enhances robustness by estimating gene expression distributions and quantifying both data-driven (aleatoric) and model-based (epistemic) uncertainty using an ensemble approach with foundation models. Interpretability is achieved through attribution analysis at single-cell resolution integrated with histopathological annotations, functional genes, and latent representations. We validated STimage across diverse datasets, demonstrating its performance across various platforms. STimage-predicted gene expression can stratify patient survival and predict drug response. By enabling molecular and cellular prediction from routine histology, STimage offers a powerful tool to advance digital pathology.

Authors

  • Xiao Tan
    College of Food Science and Engineering, Northwest University, Xi'an 710069, China; School of Chemistry & Chemical Engineering, Yulin University, Yulin 719000, China.
  • Onkar Mulay
    Genomics and Machine Learning Lab, Institute for Molecular Bioscience, St Lucia, QLD, Australia.
  • Jacky Xie
    School of Mathematics and Physics, The University of Queensland, St Lucia, QLD, Australia.
  • Samual MacDonald
    Max Kelsen, Brisbane, QLD, 4006, Australia.
  • Taehyun Kim
    Department of Neuropsychiatry, Seoul National University Bundang Hospital, 82 Gumi-ro 173beon-gil, Bundang-gu, Gyeonggi, 13620, Korea.
  • Chenhao Zhou
    Frazer Institute, Faculty of Health, Medicine and Behavioural Sciences, The University of Queensland, Woolloongabba, QLD, Australia.
  • Zherui Xiong
    Genomics and Machine Learning Lab, Institute for Molecular Bioscience, St Lucia, QLD, Australia.
  • Samuel X Tan
    Dermatology Research Centre, Frazer Institute, The University of Queensland, Brisbane, Australia.
  • Nan Ye
    School of Mathematics and Physics, The University of Queensland, Brisbane, Australia.
  • Amy McCart Reed
    UQ Centre for Clinical Research, Faculty of Health, Medicine and Behavioural Sciences, The University of Queensland, Herston, QLD, Australia.
  • Kiarash Khosrotehrani
    Dermatology Research Centre, Frazer Institute, The University of Queensland, Brisbane, Australia.
  • Fred Roosta
    ARC Training Centre for Information Resilience (CIRES), Brisbane, Australia.
  • Maciej Trzaskowski
    Max Kelsen, Brisbane, QLD, 4006, Australia. [email protected].
  • Peter T Simpson
    UQ Centre for Clinical Research, Faculty of Health, Medicine and Behavioural Sciences, The University of Queensland, Herston, QLD, Australia.
  • Quan Nguyen
    Division of Genetics and Genomics, Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia. [email protected].

Keywords

No keywords available for this article.