AIMC Topic: Sequence Alignment

Clear Filters Showing 71 to 80 of 153 articles

Distance-based protein folding powered by deep learning.

Proceedings of the National Academy of Sciences of the United States of America
Direct coupling analysis (DCA) for protein folding has made very good progress, but it is not effective for proteins that lack many sequence homologs, even coupled with time-consuming conformation sampling with fragments. We show that we can accurate...

Learning Compositional Representations of Interacting Systems with Restricted Boltzmann Machines: Comparative Study of Lattice Proteins.

Neural computation
A restricted Boltzmann machine (RBM) is an unsupervised machine learning bipartite graphical model that jointly learns a probability distribution over data and extracts their relevant statistical features. RBMs were recently proposed for characterizi...

ProteinNet: a standardized data set for machine learning of protein structure.

BMC bioinformatics
BACKGROUND: Rapid progress in deep learning has spurred its application to bioinformatics problems including protein structure prediction and design. In classic machine learning problems like computer vision, progress has been driven by standardized ...

Deep convolutional neural networks for accurate somatic mutation detection.

Nature communications
Accurate detection of somatic mutations is still a challenge in cancer analysis. Here we present NeuSomatic, the first convolutional neural network approach for somatic mutation detection, which significantly outperforms previous methods on different...

Discerning novel splice junctions derived from RNA-seq alignment: a deep learning approach.

BMC genomics
BACKGROUND: Exon splicing is a regulated cellular process in the transcription of protein-coding genes. Technological advancements and cost reductions in RNA sequencing have made quantitative and qualitative assessments of the transcriptome both poss...

Enhancing Evolutionary Couplings with Deep Convolutional Neural Networks.

Cell systems
While genes are defined by sequence, in biological systems a protein's function is largely determined by its three-dimensional structure. Evolutionary information embedded within multiple sequence alignments provides a rich source of data for inferri...

Classification of G-protein coupled receptors based on a rich generation of convolutional neural network, N-gram transformation and multiple sequence alignments.

Amino acids
Sequence classification is crucial in predicting the function of newly discovered sequences. In recent years, the prediction of the incremental large-scale and diversity of sequences has heavily relied on the involvement of machine-learning algorithm...

Template-based and free modeling of I-TASSER and QUARK pipelines using predicted contact maps in CASP12.

Proteins
We develop two complementary pipelines, "Zhang-Server" and "QUARK", based on I-TASSER and QUARK pipelines for template-based modeling (TBM) and free modeling (FM), and test them in the CASP12 experiment. The combination of I-TASSER and QUARK successf...

Protein contact prediction by integrating deep multiple sequence alignments, coevolution and machine learning.

Proteins
In this study, we report the evaluation of the residue-residue contacts predicted by our three different methods in the CASP12 experiment, focusing on studying the impact of multiple sequence alignment, residue coevolution, and machine learning on co...