A foundation model for microbial growth dynamics

Journal: bioRxiv
Published Date:

Abstract

Microbial growth dynamics contain rich information about microbial populations, which support applications from antibiotic testing to microbiome engineering. However, the high dimensionality of growth data and the scarcity of large, task-specific datasets have limited generalizable modeling analysis across systems. Here, we develop a foundation model for microbial growth dynamics. It is a large-scale, self-supervised representation model trained on ∼370,000 experimental and simulated growth curves spanning diverse microbial species, environmental conditions, and community contexts. The model learns lower-dimensional latent embeddings that capture essential dynamical features of raw growth data and enable accurate reconstruction of these data. The concise representations enhance predictive performance in diverse downstream applications. Using these embedding, we achieve few-shot learning for antibiotic classification and concentration prediction, accurate forecasting of simulated and experimental communities, and inference of total abundance from relative-abundance data. By extracting transferable representations from heterogeneous datasets, our model provides a general framework for analyzing and predicting microbial community dynamics from limited measurements.

Authors

  • Zachary A. Holmes; Irida Shyti; Alexandra L. Hoffman; Katherine E. Duncker; Helena R. Ma; Zhengqing Zhou; Dongheon Lee; Rohan Maddamsetti; Kyeri Kim; Emrah Şimşek; Grayson S. Hamrick; Hyein Son; César A. Villalobos; Jia Lu; Yuanchi Ha; Ashwini R. Shende; Zhixiang Yao; Sizhe Liu; Daniel M. Shapiro; Kseniia Kholina; Harris Davis; Yasa Baig; Feilun Wu; Shangying Wang; Xiran Wang; Pranam Chatterjee; Michael Lynch; Allison J. Lopatkin; Lawrence David; Emma Chory; Lingchong You