AIMC Topic: Molecular Sequence Annotation

Clear Filters Showing 51 to 60 of 260 articles

DDBJ Data Analysis Challenge: a machine learning competition to predict Arabidopsis chromatin feature annotations from DNA sequences.

Genes & genetic systems
Recently, the prospect of applying machine learning tools for automating the process of annotation analysis of large-scale sequences from next-generation sequencers has raised the interest of researchers. However, finding research collaborators with ...

Genome-wide inference of the Camponotus floridanus protein-protein interaction network using homologous mapping and interacting domain profile pairs.

Scientific reports
Apart from some model organisms, the interactome of most organisms is largely unidentified. High-throughput experimental techniques to determine protein-protein interactions (PPIs) are resource intensive and highly susceptible to noise. Computational...

Combined Use of Three Machine Learning Modeling Methods to Develop a Ten-Gene Signature for the Diagnosis of Ventilator-Associated Pneumonia.

Medical science monitor : international medical journal of experimental and clinical research
BACKGROUND This study aimed to use three modeling methods, logistic regression analysis, random forest analysis, and fully-connected neural network analysis, to develop a diagnostic gene signature for the diagnosis of ventilator-associated pneumonia ...

Discovery and annotation of novel microRNAs in the porcine genome by using a semi-supervised transductive learning approach.

Genomics
Despite the broad variety of available microRNA (miRNA) prediction tools, their application to the discovery and annotation of novel miRNA genes in domestic species is still limited. In this study we designed a comprehensive pipeline (eMIRNA) for miR...

Identifying pseudoenzymes using functional annotation: pitfalls of common practice.

The FEBS journal
Pseudoenzymes are proteins that are evolutionary related to enzymes but lack relevant catalytic activity. They are usually evolved from enzymatic ancestors that have lost their catalytic activities. The loss of catalytic function is one extreme among...

Machine Learning Approaches for the Prioritization of Genomic Variants Impacting Pre-mRNA Splicing.

Cells
Defects in pre-mRNA splicing are frequently a cause of Mendelian disease. Despite the advent of next-generation sequencing, allowing a deeper insight into a patient's variant landscape, the ability to characterize variants causing splicing defects ha...

Learning a functional grammar of protein domains using natural language word embedding techniques.

Proteins
In this paper, using Word2vec, a widely-used natural language processing method, we demonstrate that protein domains may have a learnable implicit semantic "meaning" in the context of their functional contributions to the multi-domain proteins in whi...

Artificial intelligence in clinical and genomic diagnostics.

Genome medicine
Artificial intelligence (AI) is the development of computer systems that are able to perform tasks that normally require human intelligence. Advances in AI software and hardware, especially deep learning algorithms and the graphics processing units (...

Machine learning for discovering missing or wrong protein function annotations : A comparison using updated benchmark datasets.

BMC bioinformatics
BACKGROUND: A massive amount of proteomic data is generated on a daily basis, nonetheless annotating all sequences is costly and often unfeasible. As a countermeasure, machine learning methods have been used to automatically annotate new protein func...

Advances in gene ontology utilization improve statistical power of annotation enrichment.

PloS one
Gene-annotation enrichment is a common method for utilizing ontology-based annotations in gene and gene-product centric knowledgebases. Effective utilization of these annotations requires inferring semantic linkages by tracing paths through edges in ...