AIMC Topic: Databases, Protein

Clear Filters Showing 1 to 10 of 727 articles

MESM: integrating multi-source data for high-accuracy protein-protein interactions prediction through multimodal language models.

BMC biology
BACKGROUND: Protein-protein interactions (PPIs) play a critical role in essential biological processes such as signal transduction, enzyme activity regulation, cytoskeletal structure, immune responses, and gene regulation. However, current methods ma...

Bag-of-words is competitive with sum-of-embeddings language-inspired representations on protein inference.

PloS one
Inferring protein function is a fundamental and long-standing problem in biology. Laboratory experiments in this field are often expensive, and therefore large-scale computational protein inference from readily available amino acid sequences is neede...

Rprot-Vec: a deep learning approach for fast protein structure similarity calculation.

BMC bioinformatics
BACKGROUND: Predicting protein structural similarity and detecting homologous sequences remain fundamental and challenging tasks in computational biology. Accurate identification of structural homologs enables function inference for newly discovered ...

Comprehensive protein datasets and benchmarking for liquid-liquid phase separation studies.

Genome biology
BACKGROUND: Proteins self-organize in dynamic cellular environments by assembling into reversible biomolecular condensates through liquid-liquid phase separation (LLPS). These condensates can comprise single or multiple proteins, with different roles...

Advancing the accuracy of clathrin protein prediction through multi-source protein language models.

Scientific reports
Clathrin is a key cytoplasmic protein that serves as the predominant structural element in the formation of coated vesicles. Specifically, clarithin enables the scission of newly formed vesicles from the plasma membrane's cytoplasmic face. Efficient ...

Ultradeep N-glycoproteome atlas of mouse reveals spatiotemporal signatures of brain aging and neurodegenerative diseases.

Nature communications
The current depth of site-specific N-glycoproteomics is insufficient to fully characterize glycosylation events in biological samples. Herein, we achieve an ultradeep and precision analysis of the N-glycoproteome of mouse tissues by integrating multi...

A fast (CNN + MCWS-transformer) based architecture for protein function prediction.

Statistical applications in genetics and molecular biology
The transformer model for sequence mining has brought a paradigmatic shift to many domains, including biological sequence mining. However, transformers suffer from quadratic complexity, i.e., O( ), where is the sequence length, which affects the tra...

M-DeepAssembly: enhanced DeepAssembly based on multi-objective multi-domain protein conformation sampling.

BMC bioinformatics
BACKGROUND: Association and cooperation among structural domains play an important role in protein function and drug design. Despite remarkable advancements in highly accurate single-domain protein structure prediction through the collaborative effor...

Accurate identification and mechanistic evaluation of pathogenic missense variants with .

Proceedings of the National Academy of Sciences of the United States of America
Understanding the effects of missense mutations or single amino acid variants (SAVs) on protein function is crucial for elucidating the molecular basis of diseases/disorders and designing rational therapies. We introduce here , a machine learning too...

ProFun-SOM: Protein Function Prediction for Specific Ontology Based on Multiple Sequence Alignment Reconstruction.

IEEE transactions on neural networks and learning systems
Protein function prediction is crucial for understanding species evolution, including viral mutations. Gene ontology (GO) is a standardized representation framework for describing protein functions with annotated terms. Each ontology is a specific fu...