AI Medical Compendium Journal:
Genome biology

Showing 61 to 70 of 89 articles

2passtools: two-pass alignment using machine-learning-filtered splice junctions increases the accuracy of intron detection in long-read RNA sequencing.

Genome biology
Transcription of eukaryotic genomes involves complex alternative processing of RNAs. Sequencing of full-length RNAs using long reads reveals the true complexity of processing. However, the relatively high error rates of long-read sequencing technolog...

Pair consensus decoding improves accuracy of neural network basecallers for nanopore sequencing.

Genome biology
We develop a general computational approach for improving the accuracy of basecalling with Oxford Nanopore's 1D and related sequencing protocols. Our software PoreOver ( https://github.com/jordisr/poreover ) finds the consensus of two neural networks...

MethylationToActivity: a deep-learning framework that reveals promoter activity landscapes from DNA methylomes in individual tumors.

Genome biology
Although genome-wide DNA methylomes have demonstrated their clinical value as reliable biomarkers for tumor detection, subtyping, and classification, their direct biological impacts at the individual gene level remain elusive. Here we present Methyla...

An interpretable bimodal neural network characterizes the sequence and preexisting chromatin predictors of induced transcription factor binding.

Genome biology
BACKGROUND: Transcription factor (TF) binding specificity is determined via a complex interplay between the transcription factor's DNA binding preference and cell type-specific chromatin environments. The chromatin features that correlate with transc...

A pitfall for machine learning methods aiming to predict across cell types.

Genome biology
Machine learning models that predict genomic activity are most useful when they make accurate predictions across cell types. Here, we show that when the training and test sets contain the same genomic loci, the resulting model may falsely appear to p...

SVFX: a machine learning framework to quantify the pathogenicity of structural variants.

Genome biology
There is a lack of approaches for identifying pathogenic genomic structural variants (SVs) although they play a crucial role in many diseases. We present a mechanism-agnostic machine learning-based workflow, called SVFX, to assign pathogenicity score...

Pathway information extracted from 25 years of pathway figures.

Genome biology
Thousands of pathway diagrams are published each year as static figures inaccessible to computational queries and analyses. Using a combination of machine learning, optical character recognition, and manual curation, we identified 64,643 pathway figu...

Mustache: multi-scale detection of chromatin loops from Hi-C and Micro-C maps using scale-space representation.

Genome biology
We present MUSTACHE, a new method for multi-scale detection of chromatin loops from Hi-C and Micro-C contact maps. MUSTACHE employs scale-space theory, a technical advance in computer vision, to detect blob-shaped objects in contact maps. MUSTACHE is...

Knowledge-primed neural networks enable biologically interpretable deep learning on single-cell sequencing data.

Genome biology
BACKGROUND: Deep learning has emerged as a versatile approach for predicting complex biological phenomena. However, its utility for biological discovery has so far been limited, given that generic deep neural networks provide little insight into the ...

DISC: a highly scalable and accurate inference of gene expression and structure for single-cell transcriptomes using semi-supervised deep learning.

Genome biology
Dropouts distort gene expression and misclassify cell types in single-cell transcriptome. Although imputation may improve gene expression and downstream analysis to some degree, it also inevitably introduces false signals. We develop DISC, a novel de...