AIMC Topic: Genome, Bacterial

Clear Filters Showing 61 to 70 of 92 articles

Pre_GI: a global map of ontological links between horizontally transferred genomic islands in bacterial and archaeal genomes.

Database : the journal of biological databases and curation
The Predicted Genomic Islands database (Pre_GI) is a comprehensive repository of prokaryotic genomic islands (islands, GIs) freely accessible at http://pregi.bi.up.ac.za/index.php. Pre_GI, Version 2015, catalogues 26 744 islands identified in 2407 ba...

Analysis of strand-specific RNA-seq data using machine learning reveals the structures of transcription units in Clostridium thermocellum.

Nucleic acids research
Identification of transcription units (TUs) encoded in a bacterial genome is essential to elucidation of transcriptional regulation of the organism. To gain a detailed understanding of the dynamically composed TU structures, we have used four strand-...

EcoliNet: a database of cofunctional gene network for Escherichia coli.

Database : the journal of biological databases and curation
During the past several decades, Escherichia coli has been a treasure chest for molecular biology. The molecular mechanisms of many fundamental cellular processes have been discovered through research on this bacterium. Although much basic research n...

Genomic language models (gLMs) decode bacterial genomes for improved gene prediction and translation initiation site identification.

Briefings in bioinformatics
Accurate bacterial gene prediction is essential for understanding microbial functions and advancing biotechnology. Traditional methods based on sequence homology and statistical models often struggle with complex genetic variations and novel sequence...

Genome Mining and Chemistry-Driven Discovery of a Cell Wall Lipopeptide Signature for subsp. Ancestral Lineage.

ACS infectious diseases
subsp. () causes Johne's disease (JD), a chronic infection responsible for considerable economic losses to dairy industries worldwide. Genetically clonal, has evolved into three distinct genetic lineages designated CII, for bovine strains, and SI ...

Deciphering the biosynthetic potential of microbial genomes using a BGC language processing neural network model.

Nucleic acids research
Biosynthetic gene clusters (BGCs), key in synthesizing microbial secondary metabolites, are mostly hidden in microbial genomes and metagenomes. To unearth this vast potential, we present BGC-Prophet, a transformer-based language model for BGC predict...

Negative dataset selection impacts machine learning-based predictors for multiple bacterial species promoters.

Bioinformatics (Oxford, England)
MOTIVATION: Advances in bacterial promoter predictors based on machine learning have greatly improved identification metrics. However, existing models overlooked the impact of negative datasets, previously identified in GC-content discrepancies betwe...

Predicting the bacterial host range of plasmid genomes using the language model-based one-class support vector machine algorithm.

Microbial genomics
The prediction of the plasmid host range is crucial for investigating the dissemination of plasmids and the transfer of resistance and virulence genes mediated by plasmids. Several machine learning-based tools have been developed to predict plasmid h...

Using core genome and machine learning for serovar prediction in Salmonella enterica subspecies I strains.

FEMS microbiology letters
This study presents a dual investigation of Salmonella enterica subspecies I, focusing on serovar prediction and core genome characteristics. We utilized two large genomic datasets (panX and NCBI Pathogen Detection) to test machine learning methods f...

PanKB: An interactive microbial pangenome knowledgebase for research, biotechnological innovation, and knowledge mining.

Nucleic acids research
The exponential growth of microbial genome data presents unprecedented opportunities for unlocking the potential of microorganisms. The burgeoning field of pangenomics offers a framework for extracting insights from this big biological data. Recent a...