AIMC Topic: Genome

Clear Filters Showing 11 to 20 of 175 articles

Genome-scale metabolic models in translational medicine: the current status and potential of machine learning in improving the effectiveness of the models.

Molecular omics
The genome-scale metabolic model (GEM) has emerged as one of the leading modeling approaches for systems-level metabolic studies and has been widely explored for a broad range of organisms and applications. Owing to the development of genome sequenci...

DeepReg: a deep learning hybrid model for predicting transcription factors in eukaryotic and prokaryotic genomes.

Scientific reports
Deep learning models (DLMs) have gained importance in predicting, detecting, translating, and classifying a diversity of inputs. In bioinformatics, DLMs have been used to predict protein structures, transcription factor-binding sites, and promoters. ...

KEGG orthology prediction of bacterial proteins using natural language processing.

BMC bioinformatics
BACKGROUND: The advent of high-throughput technologies has led to an exponential increase in uncharacterized bacterial protein sequences, surpassing the capacity of manual curation. A large number of bacterial protein sequences remain unannotated by ...

Accurate top protein variant discovery via low-N pick-and-validate machine learning.

Cell systems
A strategy to obtain the greatest number of best-performing variants with least amount of experimental effort over the vast combinatorial mutational landscape would have enormous utility in boosting resource producibility for protein engineering. Tow...

Genomic prediction using machine learning: a comparison of the performance of regularized regression, ensemble, instance-based and deep learning methods on synthetic and empirical data.

BMC genomics
BACKGROUND: The accurate prediction of genomic breeding values is central to genomic selection in both plant and animal breeding studies. Genomic prediction involves the use of thousands of molecular markers spanning the entire genome and therefore r...

Multi-label classification with XGBoost for metabolic pathway prediction.

BMC bioinformatics
BACKGROUND: Metabolic pathway prediction is one possible approach to address the problem in system biology of reconstructing an organism's metabolic network from its genome sequence. Recently there have been developments in machine learning-based pat...

Personal transcriptome variation is poorly explained by current genomic deep learning models.

Nature genetics
Genomic deep learning models can predict genome-wide epigenetic features and gene expression levels directly from DNA sequence. While current models perform well at predicting gene expression levels across genes in different cell types from the refer...

Functional annotation of enzyme-encoding genes using deep learning with transformer layers.

Nature communications
Functional annotation of open reading frames in microbial genomes remains substantially incomplete. Enzymes constitute the most prevalent functional gene class in microbial genomes and can be described by their specific catalytic functions using the ...

disperseNN2: a neural network for estimating dispersal distance from georeferenced polymorphism data.

BMC bioinformatics
Spatial genetic variation is shaped in part by an organism's dispersal ability. We present a deep learning tool, disperseNN2, for estimating the mean per-generation dispersal distance from georeferenced polymorphism data. Our neural network performs ...

Harnessing deep learning for population genetic inference.

Nature reviews. Genetics
In population genetics, the emergence of large-scale genomic data for various species and populations has provided new opportunities to understand the evolutionary forces that drive genetic diversity using statistical inference. However, the era of p...