Causal machine learning for single-cell genomics.

Journal: Nature genetics
PMID:

Abstract

Advances in single-cell '-omics' allow unprecedented insights into the transcriptional profiles of individual cells and, when combined with large-scale perturbation screens, enable measuring of the effect of targeted perturbations on the whole transcriptome. These advances provide an opportunity to better understand the causative role of genes in complex biological processes. In this Perspective, we delineate the application of causal machine learning to single-cell genomics and its associated challenges. We first present the causal model that is most commonly applied to single-cell biology and then identify and discuss potential approaches to three open problems: the lack of generalization of models to novel experimental conditions, the complexity of interpreting learned models, and the difficulty of learning cell dynamics.

Authors

  • Alejandro Tejada-Lapuerta
    Institute of Computational Biology, Helmholtz Munich, Munich, Germany.
  • Paul Bertin
    Mila, the Quebec AI Institute, Montreal, Quebec, Canada.
  • Stefan Bauer
    Department of Computer Science, ETH Zurich, Zürich, Switzerland.
  • Hananeh Aliee
    Wellcome Sanger Institute, Hinxton, UK. ha10@sanger.ac.uk.
  • Yoshua Bengio
    Université de Montréal, Montréal QC H3T 1N8, Canada.
  • Fabian J Theis
    Institute of Computational Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Munich, Germany.