TemStaPro: protein thermostability prediction using sequence representations from protein language models.

Journal: Bioinformatics (Oxford, England)
Published Date:

Abstract

MOTIVATION: Reliable prediction of protein thermostability from its sequence is valuable for both academic and industrial research. This prediction problem can be tackled using machine learning and by taking advantage of the recent blossoming of deep learning methods for sequence analysis. These methods can facilitate training on more data and, possibly, enable the development of more versatile thermostability predictors for multiple ranges of temperatures.

Authors

  • Ieva Pudžiuvelytė
    Institute of Biotechnology, Life Sciences Center, Vilnius University, LT-10257 Vilnius, Lithuania.
  • Kliment Olechnovič
    Institute of Biotechnology, Life Sciences Center, Vilnius University, Vilnius, Lithuania.
  • Egle Godliauskaite
    CasZyme, LT-10257 Vilnius, Lithuania.
  • Kristupas Sermokas
    CasZyme, LT-10257 Vilnius, Lithuania.
  • Tomas Urbaitis
    CasZyme, LT-10257 Vilnius, Lithuania.
  • Giedrius Gasiunas
    Institute of Biotechnology, Life Sciences Center, Vilnius University, LT-10257 Vilnius, Lithuania.
  • Darius Kazlauskas
    Institute of Biotechnology, Life Sciences Center, Vilnius University, LT-10257 Vilnius, Lithuania.