Universal probabilistic programming offers a powerful approach to statistical phylogenetics.

Journal: Communications biology
PMID:

Abstract

Statistical phylogenetic analysis currently relies on complex, dedicated software packages, making it difficult for evolutionary biologists to explore new models and inference strategies. Recent years have seen more generic solutions based on probabilistic graphical models, but this formalism can only partly express phylogenetic problems. Here, we show that universal probabilistic programming languages (PPLs) solve the expressivity problem, while still supporting automated generation of efficient inference algorithms. To prove the latter point, we develop automated generation of sequential Monte Carlo (SMC) algorithms for PPL descriptions of arbitrary biological diversification (birth-death) models. SMC is a new inference strategy for these problems, supporting both parameter inference and efficient estimation of Bayes factors that are used in model testing. We take advantage of this in automatically generating SMC algorithms for several recent diversification models that have been difficult or impossible to tackle previously. Finally, applying these algorithms to 40 bird phylogenies, we show that models with slowing diversification, constant turnover and many small shifts generally explain the data best. Our work opens up several related problem domains to PPL approaches, and shows that few hurdles remain before these techniques can be effectively applied to the full range of phylogenetic models.

Authors

  • Fredrik Ronquist
    Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Frescativagen 40, 114 18 Stockholm, Sweden.
  • Jan Kudlicka
    Department of Information Technology, Uppsala University, Uppsala, Sweden.
  • Viktor Senderov
    Pensoft Publishers, Prof. Georgi Zlatarski 12, Sofia, 1700, Bulgaria. vsenderov@gmail.com.
  • Johannes Borgström
    Department of Information Technology, Uppsala University, Uppsala, Sweden.
  • Nicolas Lartillot
    Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 5558, Université Claude Bernard Lyon 1, Villeurbanne, France.
  • Daniel Lundén
    Department of Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden.
  • Lawrence Murray
    Uber AI, San Francisco, CA, USA.
  • Thomas B Schön
    Division of Systems and Control, Department of Information Technology (T.B.S.), Uppsala University, Sweden.
  • David Broman
    Department of Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden.