Conditional similarity triplets enable covariate-informed representations of single-cell data.

Journal: BMC bioinformatics
Published Date:

Abstract

BACKGROUND: Single-cell technologies enable comprehensive profiling of diverse immune cell-types through the measurement of multiple genes or proteins per individual cell. In order to translate immune signatures assayed from blood or tissue into powerful diagnostics, machine learning approaches are often employed to compute immunological summaries or per-sample featurizations, which can be used as inputs to models for outcomes of interest. Current supervised learning approaches for computing per-sample representations are trained only to accurately predict a single outcome and do not take into account relevant additional clinical features or covariates that are likely to also be measured for each sample.

Authors

  • Chi-Jane Chen
    Department of Computer Science, The University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA. chijane@cs.unc.edu.
  • Haidong Yi
    Department of Computer Science, The University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA.
  • Natalie Stanley
    Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.