Machine learning based classification of cells into chronological stages using single-cell transcriptomics.

Journal: Scientific reports
PMID:

Abstract

Age-associated deterioration of cellular physiology leads to pathological conditions. The ability to detect premature aging could provide a window for preventive therapies against age-related diseases. However, the techniques for determining cellular age are limited, as they rely on a limited set of histological markers and lack predictive power. Here, we implement GERAS (GEnetic Reference for Age of Single-cell), a machine learning based framework capable of assigning individual cells to chronological stages based on their transcriptomes. GERAS displays greater than 90% accuracy in classifying the chronological stage of zebrafish and human pancreatic cells. The framework demonstrates robustness against biological and technical noise, as evaluated by its performance on independent samplings of single-cells. Additionally, GERAS determines the impact of differences in calorie intake and BMI on the aging of zebrafish and human pancreatic cells, respectively. We further harness the classification ability of GERAS to identify molecular factors that are potentially associated with the aging of beta-cells. We show that one of these factors, junba, is necessary to maintain the proliferative state of juvenile beta-cells. Our results showcase the applicability of a machine learning framework to classify the chronological stage of heterogeneous cell populations, while enabling detection of candidate genes associated with aging.

Authors

  • Sumeet Pal Singh
    Center for Molecular and Cellular Bioengineering, TU Dresden, Dresden, 01307, Germany. sumeet_pal.singh@tu-dresden.de.
  • Sharan Janjuha
    Institute of Pharmacology and Toxicology, University of Zurich, Zurich, Switzerland.
  • Samata Chaudhuri
    Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, 01307, Germany.
  • Susanne Reinhardt
    Center for Molecular and Cellular Bioengineering, TU Dresden, Dresden, 01307, Germany.
  • Annekathrin Kränkel
    Center for Molecular and Cellular Bioengineering, TU Dresden, Dresden, 01307, Germany.
  • Sevina Dietz
    Center for Molecular and Cellular Bioengineering, TU Dresden, Dresden, 01307, Germany.
  • Anne Eugster
    Center for Molecular and Cellular Bioengineering, TU Dresden, Dresden, 01307, Germany.
  • Halil Bilgin
    Department of Computer Engineering, Abdullah Gül University, Kayseri, 38030, Turkey.
  • Selcuk Korkmaz
    Department of Biostatistics, Faculty of Medicine, Hacettepe University, Sihhiye, Ankara, Turkey.
  • Gökmen Zararsız
    Department of Biostatistics, Erciyes University, Kayseri, 38030, Turkey.
  • Nikolay Ninov
    Center for Molecular and Cellular Bioengineering, TU Dresden, Dresden, 01307, Germany.
  • John E Reid
    MRC Biostatistics Unit, University of Cambridge, Cambridge, CB2 0SR, UK.