Predicting protein condensate formation using machine learning.

Journal: Cell reports
Published Date:

Abstract

Membraneless organelles are liquid condensates, which form through liquid-liquid phase separation. Recent advances show that phase separation is essential for cellular homeostasis by regulating basic cellular processes, including transcription and signal transduction. The reported number of proteins with the capacity to mediate protein phase separation (PPS) is continuously growing. While computational tools for predicting PPS have been developed, obtaining a proteome-wide overview of PPS probabilities has remained challenging. Here, we present a phase separation analysis and prediction (PSAP) machine-learning classifier that, based solely on the amino acid content of a training set of known PPS proteins, can determine the phase separation likelihood for each protein in a given proteome. Through comparison with PPS databases, existing predictors, and experimental evidence, we demonstrate the validity and advantages of the PSAP classifier. We anticipate that the PSAP predictor provides a useful tool for future research aimed at identifying phase separating proteins in health and disease.

Authors

  • Guido van Mierlo
    Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Oncode Institute, Radboud University Nijmegen, 6525 GA Nijmegen, the Netherlands. Electronic address: g.vanmierlo@science.ru.nl.
  • Jurriaan R G Jansen
    Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Oncode Institute, Radboud University Nijmegen, 6525 GA Nijmegen, the Netherlands.
  • Jie Wang
  • Ina Poser
    Max Planck Institute (MPI) for Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany; Dewpoint Therapeutics GmbH, c/o Max Planck Institute (MPI) for Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany.
  • Simon J van Heeringen
    Department of Molecular Developmental Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Radboud University Nijmegen, 6500 HB Nijmegen, the Netherlands. Electronic address: s.vanheeringen@science.ru.nl.
  • Michiel Vermeulen
    Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Oncode Institute, Radboud University Nijmegen, Geert Grooteplein Zuid 28, 6525GA, Nijmegen, The Netherlands.