AIMC Topic: Molecular Sequence Data

Clear Filters Showing 11 to 20 of 32 articles

Identification of Heat Shock Protein families and J-protein types by incorporating Dipeptide Composition into Chou's general PseAAC.

Computer methods and programs in biomedicine
Heat Shock Proteins (HSPs) are the substantial ingredients for cell growth and viability, which are found in all living organisms. HSPs manage the process of folding and unfolding of proteins, the quality of newly synthesized proteins and protecting ...

PredSTP: a highly accurate SVM based model to predict sequential cystine stabilized peptides.

BMC bioinformatics
BACKGROUND: Numerous organisms have evolved a wide range of toxic peptides for self-defense and predation. Their effective interstitial and macro-environmental use requires energetic and structural stability. One successful group of these peptides in...

TRAL: tandem repeat annotation library.

Bioinformatics (Oxford, England)
MOTIVATION: Currently, more than 40 sequence tandem repeat detectors are published, providing heterogeneous, partly complementary, partly conflicting results.

Machine learning assisted design of highly active peptides for drug discovery.

PLoS computational biology
The discovery of peptides possessing high biological activity is very challenging due to the enormous diversity for which only a minority have the desired properties. To lower cost and reduce the time to obtain promising peptides, machine learning ap...

Accurate in silico identification of protein succinylation sites using an iterative semi-supervised learning technique.

Journal of theoretical biology
As a widespread type of protein post-translational modifications (PTMs), succinylation plays an important role in regulating protein conformation, function and physicochemical properties. Compared with the labor-intensive and time-consuming experimen...

Identifying DNA-binding proteins by combining support vector machine and PSSM distance transformation.

BMC systems biology
BACKGROUND: DNA-binding proteins play a pivotal role in various intra- and extra-cellular activities ranging from DNA replication to gene expression control. Identification of DNA-binding proteins is one of the major challenges in the field of genome...

An improved poly(A) motifs recognition method based on decision level fusion.

Computational biology and chemistry
Polyadenylation is the process of addition of poly(A) tail to mRNA 3' ends. Identification of motifs controlling polyadenylation plays an essential role in improving genome annotation accuracy and better understanding of the mechanisms governing gene...

Using support vector machines to identify protein phosphorylation sites in viruses.

Journal of molecular graphics & modelling
Phosphorylation of viral proteins plays important roles in enhancing replication and inhibition of normal host-cell functions. Given its importance in biology, a unique opportunity has arisen to identify viral protein phosphorylation sites. However, ...

CAMELOT: A machine learning approach for coarse-grained simulations of aggregation of block-copolymeric protein sequences.

The Journal of chemical physics
We report the development and deployment of a coarse-graining method that is well suited for computer simulations of aggregation and phase separation of protein sequences with block-copolymeric architectures. Our algorithm, named CAMELOT for Coarse-g...