DeepReg: a deep learning hybrid model for predicting transcription factors in eukaryotic and prokaryotic genomes.

Journal: Scientific reports

PMID: 38644393

Abstract

Deep learning models (DLMs) have gained importance in predicting, detecting, translating, and classifying a diversity of inputs. In bioinformatics, DLMs have been used to predict protein structures, transcription factor-binding sites, and promoters. In this work, we propose a hybrid model to identify transcription factors (TFs) among prokaryotic and eukaryotic protein sequences, named Deep Regulation (DeepReg) model. Two architectures were used in the DL model: a convolutional neural network (CNN), and a bidirectional long-short-term memory (BiLSTM). DeepReg reached a precision of 0.99, a recall of 0.97, and an F1-score of 0.98. The quality of our predictions, the bias-variance trade-off approach, and the characterization of new TF predictions were evaluated and compared against those produced by DeepTFactor, as well as against experimental data from three model organisms. Predictions based on our DLM tended to exhibit less variance and bias than those from DeepTFactor, thus increasing reliability and decreasing overfitting.

Authors

Leonardo Ledesma-Dominguez

Posgrado en Ciencia en Ingeniería de la Computación, Universidad Nacional Autónoma de México, 04510, Mexico City, Mexico. leonardoledd@ciencias.unam.mx.
Erik Carbajal-Degante

Posgrado en Ciencia e Ingenieria de la Computación, Universidad Nacional Autónoma de Mexico, Mexico City, Mexico. Electronic address: eydgant@comunidad.unam.mx.
Gabriel Moreno-Hagelsieb

Department of Biology, Wilfrid Laurier University, Waterloo, ON, Canada.
Ernesto Perez-Rueda

Unidad Académica de Yucatán, Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de México, Yucatán, Mérida, Mexico.

Keywords

Binding Sites Computational Biology Deep Learning Eukaryota Eukaryotic Cells Genome Neural Networks, Computer Prokaryotic Cells Transcription Factors

External Resources

View on PubMed Access via DOI PubMed (38644393)

DeepReg: a deep learning hybrid model for predicting transcription factors in eukaryotic and prokaryotic genomes.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals