Additive baselines furnish no evidence for epistasis learning by MULTI-evolve

Journal: bioRxiv
Published Date:

Abstract

Recent work from Tran et al. (Science, 2026) introduced MULTI-evolve, a framework for protein engineering that combines single-mutant nomination via a protein language model (PLM) or a deep mutational scan (DMS), experimental single- and double-mutant characterization, and neural networks to engineer hyperactive multimutant proteins. The authors attribute the framework's performance to "epistasis-aware modeling" and claim that their neural networks "learn the epistatic landscape" and "identify synergistic interactions" from limited double-mutant training data. Here we show that MULTI-evolve's multimutant predictions are almost perfectly correlated with an additive model across all three engineering applications (APEX, dCasRx, and HuABC2), such that the engineering of multimutants reduces to combining beneficial mutations with the largest additive effects--a standard protein engineering strategy for over four decades. We also find that MULTI-evolve's neural networks do not outperform an additive model in held-out test set predictions. Finally, we revisit a DMS benchmark finding presented as evidence of epistasis learning and show that it is expected even under a null additive model due to an elementary statistical phenomenon. Indeed, we fit an additive model to the benchmark data and reproduce the pattern purported to demonstrate epistasis learning.

Authors

  • Visani
  • G. M.; Verma
  • A.; DeWitt
  • W. S.

Categories