AIMC Topic: Molecular Sequence Annotation

Clear Filters Showing 11 to 20 of 255 articles

Protein function annotation and virulence factor identification of Klebsiella pneumoniae genome by multiple machine learning models.

Microbial pathogenesis
Klebsiella pneumoniae is a type of Gram-negative bacterium which can cause a range of infections in human. In recent years, an increasing number of strains of K. pneumoniae resistant to multiple antibiotics have emerged, posing a significant threat t...

Vocabulary Matters: An Annotation Pipeline and Four Deep Learning Algorithms for Enzyme Named Entity Recognition.

Journal of proteome research
Enzymes are indispensable in many biological processes, and with biomedical literature growing exponentially, effective literature review becomes increasingly challenging. Natural language processing methods offer solutions to streamline this process...

KEGG orthology prediction of bacterial proteins using natural language processing.

BMC bioinformatics
BACKGROUND: The advent of high-throughput technologies has led to an exponential increase in uncharacterized bacterial protein sequences, surpassing the capacity of manual curation. A large number of bacterial protein sequences remain unannotated by ...

AnnoPRO: a strategy for protein function annotation based on multi-scale protein representation and a hybrid deep learning of dual-path encoding.

Genome biology
Protein function annotation has been one of the longstanding issues in biological sciences, and various computational methods have been developed. However, the existing methods suffer from a serious long-tail problem, with a large number of GO famili...

Functional annotation of enzyme-encoding genes using deep learning with transformer layers.

Nature communications
Functional annotation of open reading frames in microbial genomes remains substantially incomplete. Enzymes constitute the most prevalent functional gene class in microbial genomes and can be described by their specific catalytic functions using the ...

Uncovering new families and folds in the natural protein universe.

Nature
We are now entering a new era in protein sequence and structure annotation, with hundreds of millions of predicted protein structures made available through the AlphaFold database. These models cover nearly all proteins that are known, including thos...

Enzyme function prediction using contrastive learning.

Science (New York, N.Y.)
Enzyme function annotation is a fundamental challenge, and numerous computational tools have been developed. However, most of these tools cannot accurately predict functional annotations, such as enzyme commission (EC) number, for less-studied protei...

Comprehensive Functional Annotation of Metagenomes and Microbial Genomes Using a Deep Learning-Based Method.

mSystems
Comprehensive protein function annotation is essential for understanding microbiome-related disease mechanisms in the host organisms. However, a large portion of human gut microbial proteins lack functional annotation. Here, we have developed a new m...

ISPRED-SEQ: Deep Neural Networks and Embeddings for Predicting Interaction Sites in Protein Sequences.

Journal of molecular biology
The knowledge of protein-protein interaction sites (PPIs) is crucial for protein functional annotation. Here we address the problem focusing on the prediction of putative PPIs considering as input protein sequences. The issue is important given the h...

DeepPlnc: Bi-modal deep learning for highly accurate plant lncRNA discovery.

Genomics
We present here a bi-modal CNN based deep-learning system, DeepPlnc, to identify plant lncRNAs with high accuracy while using sequence and structural properties. Unlike most of the existing software, it works accurately even in conditions with ambigu...