Machine learning applied to enzyme turnover numbers reveals protein structural correlates and improves metabolic models.

Journal: Nature communications
Published Date:

Abstract

Knowing the catalytic turnover numbers of enzymes is essential for understanding the growth rate, proteome composition, and physiology of organisms, but experimental data on enzyme turnover numbers is sparse and noisy. Here, we demonstrate that machine learning can successfully predict catalytic turnover numbers in Escherichia coli based on integrated data on enzyme biochemistry, protein structure, and network context. We identify a diverse set of features that are consistently predictive for both in vivo and in vitro enzyme turnover rates, revealing novel protein structural correlates of catalytic turnover. We use our predictions to parameterize two mechanistic genome-scale modelling frameworks for proteome-limited metabolism, leading to significantly higher accuracy in the prediction of quantitative proteome data than previous approaches. The presented machine learning models thus provide a valuable tool for understanding metabolism and the proteome at the genome scale, and elucidate structural, biochemical, and network properties that underlie enzyme kinetics.

Authors

  • David Heckmann
    Heinrich-Heine-Universität, Institute for Computer Science, 40225 Düsseldorf, Germany. Electronic address: david.heckmann@uni-duesseldorf.de.
  • Colton J Lloyd
    Department of Bioengineering, University of California, San Diego, La Jolla, California, USA.
  • Nathan Mih
    Department of Bioengineering, University of California, San Diego, La Jolla, California, USA.
  • Yuanchi Ha
    Department of Bioengineering, University of California, San Diego, La Jolla, CA, 92093-0412, USA.
  • Daniel C Zielinski
    Department of Bioengineering, University of California, San Diego, La Jolla, CA, 92093-0412, USA.
  • Zachary B Haiman
    Department of Bioengineering, University of California, San Diego, La Jolla, CA, 92093-0412, USA.
  • Abdelmoneim Amer Desouki
    Institute for Computer Science and Department of Biology, Heinrich Heine University, 40225, Düsseldorf, Germany.
  • Martin J Lercher
    Institute for Computer Science and Department of Biology, Heinrich Heine University, 40225, Düsseldorf, Germany.
  • Bernhard O Palsson
    Department of Bioengineering, University of California, San Diego, CA, USA.