The proteome landscape of the kingdoms of life.

Journal: Nature
PMID:

Abstract

Proteins carry out the vast majority of functions in all biological domains, but for technological reasons their large-scale investigation has lagged behind the study of genomes. Since the first essentially complete eukaryotic proteome was reported, advances in mass-spectrometry-based proteomics have enabled increasingly comprehensive identification and quantification of the human proteome. However, there have been few comparisons across species, in stark contrast with genomics initiatives. Here we use an advanced proteomics workflow-in which the peptide separation step is performed by a microstructured and extremely reproducible chromatographic system-for the in-depth study of 100 taxonomically diverse organisms. With two million peptide and 340,000 stringent protein identifications obtained in a standardized manner, we double the number of proteins with solid experimental evidence known to the scientific community. The data also provide a large-scale case study for sequence-based machine learning, as we demonstrate by experimentally confirming the predicted properties of peptides from Bacteroides uniformis. Our results offer a comparative view of the functional organization of organisms across the entire evolutionary range. A remarkably high fraction of the total proteome mass in all kingdoms is dedicated to protein homeostasis and folding, highlighting the biological challenge of maintaining protein structure in all branches of life. Likewise, a universally high fraction is involved in supplying energy resources, although these pathways range from photosynthesis through iron sulfur metabolism to carbohydrate metabolism. Generally, however, proteins and proteomes are remarkably diverse between organisms, and they can readily be explored and functionally compared at www.proteomesoflife.org.

Authors

  • Johannes B Müller
    Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany.
  • Philipp E Geyer
    Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany.
  • Ana R Colaço
    NNF Center for Protein Research, Faculty of Health Sciences, University of Copenhagen, Copenhagen, Denmark.
  • Peter V Treit
    Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany.
  • Maximilian T Strauss
    Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany.
  • Mario Oroshi
    Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany.
  • Sophia Doll
    Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany.
  • Sebastian Virreira Winter
    Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany.
  • Jakob M Bader
    Department of Proteomics and Signal Transduction, Max Planck Institute of Biochemistry, Martinsried, Germany.
  • Niklas Köhler
    Clinical Infectious Diseases, Research Center Borstel, Borstel, Germany.
  • Fabian Theis
    Helmholtz Zentrum München-German Research Center for Environmental Health, Institute of Computational Biology, Neuherberg, Munich, Germany.
  • Alberto Santos
    Disease Systems Biology, Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, 2200, Denmark.
  • Matthias Mann
    From the ‡Proteomics and Signal Transduction Group and mmann@biochem.mpg.de.