AIMC Topic: Sequence Alignment

Clear Filters Showing 91 to 100 of 153 articles

Prediction Enhancement of Residue Real-Value Relative Accessible Surface Area in Transmembrane Helical Proteins by Solving the Output Preference Problem of Machine Learning-Based Predictors.

Journal of chemical information and modeling
The α-helical transmembrane proteins constitute 25% of the entire human proteome space and are difficult targets in high-resolution wet-lab structural studies, calling for accurate computational predictors. We present a novel sequence-based method ca...

Survey of Natural Language Processing Techniques in Bioinformatics.

Computational and mathematical methods in medicine
Informatics methods, such as text mining and natural language processing, are always involved in bioinformatics research. In this study, we discuss text mining and natural language processing methods in bioinformatics from two perspectives. First, we...

SAMSVM: A tool for misalignment filtration of SAM-format sequences with support vector machine.

Journal of bioinformatics and computational biology
Sequence alignment/map (SAM) formatted sequences [Li H, Handsaker B, Wysoker A et al., Bioinformatics 25(16):2078-2079, 2009.] have taken on a main role in bioinformatics since the development of massive parallel sequencing. However, because misalign...

Accurate contact predictions using covariation techniques and machine learning.

Proteins
Here we present the results of residue-residue contact predictions achieved in CASP11 by the CONSIP2 server, which is based around our MetaPSICOV contact prediction method. On a set of 40 target domains with a median family size of around 40 effectiv...

Protein contact prediction by integrating joint evolutionary coupling analysis and supervised learning.

Bioinformatics (Oxford, England)
MOTIVATION: Protein contact prediction is important for protein structure and functional study. Both evolutionary coupling (EC) analysis and supervised machine learning methods have been developed, making use of different information sources. However...

GoFDR: A sequence alignment based method for predicting protein functions.

Methods (San Diego, Calif.)
In this study, we developed a method named GoFDR for predicting Gene Ontology (GO)-based protein functions. The input for GoFDR is simply a query sequence-based multiple sequence alignment (MSA) produced by PSI-BLAST. For each GO term annotated to th...

Identification of Heat Shock Protein families and J-protein types by incorporating Dipeptide Composition into Chou's general PseAAC.

Computer methods and programs in biomedicine
Heat Shock Proteins (HSPs) are the substantial ingredients for cell growth and viability, which are found in all living organisms. HSPs manage the process of folding and unfolding of proteins, the quality of newly synthesized proteins and protecting ...

Genome Modeling System: A Knowledge Management Platform for Genomics.

PLoS computational biology
In this work, we present the Genome Modeling System (GMS), an analysis information management system capable of executing automated genome analysis pipelines at a massive scale. The GMS framework provides detailed tracking of samples and data coupled...

miRBoost: boosting support vector machines for microRNA precursor classification.

RNA (New York, N.Y.)
Identification of microRNAs (miRNAs) is an important step toward understanding post-transcriptional gene regulation and miRNA-related pathology. Difficulties in identifying miRNAs through experimental techniques combined with the huge amount of data ...

A novel representation of genomic sequences for taxonomic clustering and visualization by means of self-organizing maps.

Bioinformatics (Oxford, England)
MOTIVATION: Self-organizing maps (SOMs) are readily available bioinformatics methods for clustering and visualizing high-dimensional data, provided that such biological information is previously transformed to fixed-size, metric-based vectors. To inc...