Nucleic acids research - AI Medical Compendium Journals

m6ACali: machine learning-powered calibration for accurate m6A detection in MeRIP-Seq.

Nucleic acids research May 22, 2024

We present m6ACali, a novel machine-learning framework aimed at enhancing the accuracy of N6-methyladenosine (m6A) epitranscriptome profiling by reducing the impact of non-specific antibody enrichment in MeRIP-Seq. The calibration model serves as a g...

Transcriptome RNA, Messenger Humans Machine Learning Calibration Adenosine Sequence Analysis, RNA Methylation

View on PubMed DOI

Prediction of DNA i-motifs via machine learning.

Nucleic acids research Mar 21, 2024

i-Motifs (iMs), are secondary structures formed in cytosine-rich DNA sequences and are involved in multiple functions in the genome. Although putative iM forming sequences are widely distributed in the human genome, the folding status and strength of...

Machine Learning DNA Humans Base Sequence Cytosine Nucleotide Motifs

View on PubMed DOI

EquiPNAS: improved protein-nucleic acid binding site prediction using protein-language-model-informed equivariant deep graph neural networks.

Nucleic acids research Mar 21, 2024

Protein language models (pLMs) trained on a large corpus of protein sequences have shown unprecedented scalability and broad generalizability in a wide range of predictive modeling tasks, but their power has not yet been harnessed for predicting prot...

Proteins Amino Acid Sequence Nucleic Acids Binding Sites Neural Networks, Computer

View on PubMed DOI

Language model-based B cell receptor sequence embeddings can effectively encode receptor specificity.

Nucleic acids research Jan 25, 2024

High throughput sequencing of B cell receptors (BCRs) is increasingly applied to study the immense diversity of antibodies. Learning biologically meaningful embeddings of BCR sequences is beneficial for predictive modeling. Several embedding methods ...

Immunoglobulins Receptors, Antigen, B-Cell Natural Language Processing Humans Amino Acid Sequence High-Throughput Nucleotide Sequencing

View on PubMed DOI

Multiple sequence alignment-based RNA language model and its application to structural inference.

Nucleic acids research Jan 11, 2024

Compared with proteins, DNA and RNA are more difficult languages to interpret because four-letter coded DNA/RNA sequences have less information content than 20-letter coded protein sequences. While BERT (Bidirectional Encoder Representations from Tra...

RNA Solvents Proteins DNA Sequence Alignment Machine Learning

View on PubMed DOI