adabmDCA: adaptive Boltzmann machine learning for biological sequences.

Journal: BMC bioinformatics
Published Date:

Abstract

BACKGROUND: Boltzmann machines are energy-based models that have been shown to provide an accurate statistical description of domains of evolutionary-related protein and RNA families. They are parametrized in terms of local biases accounting for residue conservation, and pairwise terms to model epistatic coevolution between residues. From the model parameters, it is possible to extract an accurate prediction of the three-dimensional contact map of the target domain. More recently, the accuracy of these models has been also assessed in terms of their ability in predicting mutational effects and generating in silico functional sequences.

Authors

  • Anna Paola Muntoni
    Statistical Inference and Biological Modeling Group, Italian Institute for Genomic Medicine, Candiolo, Italy. anna.muntoni@polito.it.
  • Andrea Pagnani
    Statistical Inference and Biological Modeling Group, Italian Institute for Genomic Medicine, Candiolo, Italy.
  • Martin Weigt
    Institut de Biologie Paris Seine, Biologie Computationnelle et Quantitative LCQB, CNRS, Sorbonne Université, Paris, France.
  • Francesco Zamponi
    Laboratoire de Physique de l'Ecole Normale Supérieure, ENS, CNRS, Université PSL, Sorbonne Université, Université de Paris, Paris, France.