pertTF: context-aware AI modeling for genome-scale and cross-system perturbation prediction
Journal:
bioRxiv
Published Date:
Mar 16, 2026
Abstract
Predicting genetic perturbation responses at a single-cell level is central to building models for cell state and disease. However, existing approaches are limited on predicting phenotypic outcomes beyond expression changes and generalizing predictions across genome-scale perturbations in biologically relevant contexts. Here we introduce pertTF, a transformer-based single-cell genetic perturbation model. pertTF was trained from a unique dataset capturing single cell expressions profiles of 30 full gene knockouts across 14 relevant cell types during human pancreatic development and beta-cell differentiation. pertTF outperforms current methods in predicting expression changes of perturbing unseen genes in unseen cellular contexts. In addition, pertTF infers perturbation-induced shifts in cell identity and population composition, an important phenotypic outcome of perturbation in many physiology and disease settings. Through transfer learning, pertTF operates in physiologically relevant systems, including primary human islets, where large-scale perturbation experiments are challenging. The generalizability of pertTF is further demonstrated by in silico pooled and single-cell CRISPR screens, capturing critical regulators of stem cells and early pancreatic cell development. These results establish pertTF as a framework for integrating large-scale single-cell perturbation data with AI models to predict genetic perturbation effects across cellular systems and disease contexts.