Segmentation aware probabilistic phenotyping of single-cell spatial protein expression data.

Journal: Nature communications
PMID:

Abstract

Spatial protein expression technologies can map cellular content and organization by simultaneously quantifying the expression of >40 proteins at subcellular resolution within intact tissue sections and cell lines. However, necessary image segmentation to single cells is challenging and error prone, easily confounding the interpretation of cellular phenotypes and cell clusters. To address these limitations, we present STARLING, a probabilistic machine learning model designed to quantify cell populations from spatial protein expression data while accounting for segmentation errors. To evaluate performance, we develop a comprehensive benchmarking workflow by generating highly multiplexed imaging data of cell line pellet standards with controlled cell content and marker expression and additionally established a score to quantify the biological plausibility of discovered cellular phenotypes on patient-derived tissue sections. Moreover, we generate spatial expression data of the human tonsil-a densely packed tissue prone to segmentation errors-and demonstrate cellular states captured by STARLING identify known cell types not visible with other methods and enable quantification of intra- and inter- individual heterogeneity.

Authors

  • Yuju Lee
    Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON, Canada.
  • Edward L Y Chen
    Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON, Canada.
  • Darren C H Chan
    Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON, Canada.
  • Anuroopa Dinesh
    Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON, Canada.
  • Somaieh Afiuni-Zadeh
    Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON, Canada.
  • Conor Klamann
    Data Sciences Institute, University of Toronto, Toronto, ON, Canada.
  • Alina Selega
    Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON, Canada.
  • Miralem Mrkonjic
    Sinai Health System, Toronto, ON, Canada.
  • Hartland W Jackson
    Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON M5G 1X5, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada; Ontario Institute of Cancer Research, Toronto, ON M5G 1M1, Canada.
  • Kieran R Campbell
    Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON M5G 1X5, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada; Department of Statistical Sciences, University of Toronto, Toronto, ON M5S 3G3, Canada; Department of Computer Science, University of Toronto, Toronto, ON M5T 3A1, Canada; Ontario Institute of Cancer Research, Toronto, ON M5G 1M1, Canada; Vector Institute, Toronto, ON M5G 1M1, Canada. Electronic address: kierancampbell@lunenfeld.ca.