AIMC Topic: Sequence Alignment

Clear Filters Showing 1 to 10 of 153 articles

High-accuracy protein complex structure modeling based on sequence-derived structure complementarity.

Nature communications
In living organisms, proteins perform key functions required for life activities by interacting to form complexes. Determining the protein complex structure is crucial for understanding and mastering biological functions. Although AlphaFold2 makes a ...

Coevolutionary signals in multiple sequence alignments improve virulence factor prediction with an MSA Transformer.

Scientific reports
Identification of virulence factors (VFs) is critical for expanding our knowledge on bacterial pathogenesis and also for developing targeted strategies for the prevention and treatment of related infectious diseases. Understanding virulence factors r...

OneProt: Towards multi-modal protein foundation models via latent space alignment of sequence, structure, binding sites and text encoders.

PLoS computational biology
Recent advances in Artificial Intelligence have enabled multi-modal systems to model and translate diverse information spaces. Extending beyond text and vision, we introduce OneProt, a multi-modal Deep Learning model for proteins that integrates stru...

Predicting protein-protein interactions in the human proteome.

Science (New York, N.Y.)
Protein-protein interactions (PPIs) are essential for biological function. Coevolutionary analysis and deep-learning (DL)-based protein structure prediction have enabled comprehensive PPI identification in bacteria and yeast, but these approaches hav...

SARST2 high-throughput and resource-efficient protein structure alignment against massive databases.

Nature communications
The flood of protein structural Big Data is coming. With the belief that biotech researchers deserve powerful analysis engines to overcome the challenge of rapidly increasing computational demands, we are devoted to developing efficient protein struc...

TEtrimmer: a tool to automate the manual curation of transposable elements.

Nature communications
Transposable elements (TEs) are repetitive DNA sequences that move within genomes and play important roles in gene regulation and genome evolution. Accurate TE annotation in genomesĀ is crucial for downstream analyses but challenging due to their sequ...

Partner-RBR: Predicting Multitype RNA-Binding Residues Based on Mutual Learning.

Journal of chemical information and modeling
RNA molecules play diverse and critical roles in various biological processes, including gene expression, post-transcriptional regulation, and disease pathogenesis. Understanding the interaction between proteins and RNA necessitates the precise ident...

In silico design of smaller size enzymatic protein by generative artificial intelligence (ProtGPT2).

Journal of bioscience and bioengineering
The construction of small proteins by removing amino acid subsequences that are not involved in function, activity, or structure is crucial for bioprocessing and drug development. Traditional design methods often focus on reconstructing functional mo...

AMPGen: an evolutionary information-reserved and diffusion-driven generative model for de novo design of antimicrobial peptides.

Communications biology
The rapid advancement of artificial intelligence (AI) has enabled de novo design of functional proteins, circumventing the reliance on natural templates or sequencing databases. However, current protein design models are ineffective in generating pro...

ProFun-SOM: Protein Function Prediction for Specific Ontology Based on Multiple Sequence Alignment Reconstruction.

IEEE transactions on neural networks and learning systems
Protein function prediction is crucial for understanding species evolution, including viral mutations. Gene ontology (GO) is a standardized representation framework for describing protein functions with annotated terms. Each ontology is a specific fu...