EMQIT: a machine learning approach for energy based PWM matrix quality improvement.

Journal: Biology direct
Published Date:

Abstract

BACKGROUND: Transcription factor binding affinities to DNA play a key role for the gene regulation. Learning the specificity of the mechanisms of binding TFs to DNA is important both to experimentalists and theoreticians. With the development of high-throughput methods such as, e.g., ChiP-seq the need to provide unbiased models of binding events has been made apparent. We present EMQIT a modification to the approach introduced by Alamanova et al. and later implemented as 3DTF server. We observed that tuning of Boltzmann factor weights, used for conversion of calculated energies to nucleotide probabilities, has a significant impact on the quality of the associated PWM matrix.

Authors

  • Karolina Smolinska
    Institute of Automatic Control, Silesian University of Technology, Akademicka 16, 44-100, Gliwice, Poland.
  • Marcin Pacholczyk
    Institute of Automatic Control, Silesian University of Technology, Akademicka 16, 44-100, Gliwice, Poland. marcin.pacholczyk@polsl.pl.