How Deepbics Quantifies Intensities of Transcription Factor-DNA Binding and Facilitates Prediction of Single Nucleotide Variant Pathogenicity With a Deep Learning Model Trained On ChIP-Seq Data Sets.

Journal: IEEE/ACM transactions on computational biology and bioinformatics

PMID: 35471887

Abstract

The binding of DNA sequences to cell type-specific transcription factors is essential for regulating gene expression in all organisms. Many variants occurring in these binding regions play crucial roles in human disease by disrupting the cis-regulation of gene expression. We first implemented a sequence-based deep learning model called deepBICS to quantify the intensity of transcription factors-DNA binding. The experimental results not only showed the superiority of deepBICS on ChIP-seq data sets but also suggested deepBICS as a language model could help the classification of disease-related and neutral variants. We then built a language model-based method called deepBICS4SNV to predict the pathogenicity of single nucleotide variants. The good performance of deepBICS4SNV on 2 tests related to Mendelian disorders and viral diseases shows the sequence contextual information derived from language models can improve prediction accuracy and generalization capability.

Authors

Lijun Quan

School of Computer Science and Technology, Soochow University, Suzhou 215006, China.
Xiaomin Chu
Xiaoyu Sun

Department of Radiology, The Second Affiliated Hospital, Jiangxi Medical College, Nanchang University, Nanchang, China.
Tingfang Wu

1 Key Laboratory of Image Information Processing and Intelligent Control of Education Ministry of China, School of Automation, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China.
Qiang Lyu

Department of Computer Science and Technology, Soochow University, Suzhou, Jiangsu, 215006, China.

Keywords

Binding Sites Chromatin Immunoprecipitation Sequencing Deep Learning DNA Humans Nucleotides Transcription Factors Virulence

External Resources

View on PubMed Access via DOI PubMed (35471887)

How Deepbics Quantifies Intensities of Transcription Factor-DNA Binding and Facilitates Prediction of Single Nucleotide Variant Pathogenicity With a Deep Learning Model Trained On ChIP-Seq Data Sets.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals