VAETracer: Mutation-Guided Lineage Reconstruction and Generational State Inference from scRNA-seq

Journal: bioRxiv
Published Date:

Abstract

Somatic mutations accumulate with cell division and are key to understanding tumor evolution. While single-cell RNA sequencing (scRNA-seq) can effectively capture somatic mutations in the 3' untranslated region (3' UTR), enabling its use for lineage tracing, these data are inherently noisy and exhibit high false-positive rates. To address this limitation, we propose VAETracer, a deep learning framework that is able to reconstruct cellular lineages by extracting cellular generation index (CGI) from mutation profiles of 3' UTR in scRNA-seq data, thus enabling the inference of developmental trajectories without relying on noisy mutation signals (https://github.com/Kaiyu-W/VAETracer). There are two core components for VAETracer: (1) scMut module, which infers the CGI directly from sparse and noisy mutation matrices using our cumulative mutation model (CMM), effectively bypassing error-prone phylogenetic tree topologies caused by mutation noise, and (2) MutTracer module, which predicts ancestral and future latent cellular states or gene expressions at the single-cell level by integrating CGI information with measured transcriptomes. Validation on simulated and real tumor datasets shows that this method can accurately reconstruct clonal relationships, quantify tumor progression, and effectively infer unmeasured cellular states using only the measured scRNA-seq data. This work provides a new computational tool for extracting lineage information and revealing tumor evolution directly from widely available RNA-seq data.

Authors

  • Pan
  • L.; Wang
  • K.; Chen
  • L.

Categories