AIMC Topic: INDEL Mutation

Clear Filters Showing 11 to 20 of 22 articles

Decoding whole-genome mutational signatures in 37 human pan-cancers by denoising sparse autoencoder neural network.

Oncogene
Millions of somatic mutations have recently been discovered in cancer genomes. These mutations in cancer genomes occur due to internal and external mutagenesis forces. Decoding the mutational processes by examining their unique patterns has successfu...

Ranking of non-coding pathogenic variants and putative essential regions of the human genome.

Nature communications
A gene is considered essential if loss of function results in loss of viability, fitness or in disease. This concept is well established for coding genes; however, non-coding regions are thought less likely to be determinants of critical functions. H...

Optimized CRISPR guide RNA design for two high-fidelity Cas9 variants by deep learning.

Nature communications
Highly specific Cas9 nucleases derived from SpCas9 are valuable tools for genome editing, but their wide applications are hampered by a lack of knowledge governing guide RNA (gRNA) activity. Here, we perform a genome-scale screen to measure gRNA acti...

Comparative performances of machine learning methods for classifying Crohn Disease patients using genome-wide genotyping data.

Scientific reports
Crohn Disease (CD) is a complex genetic disorder for which more than 140 genes have been identified using genome wide association studies (GWAS). However, the genetic architecture of the trait remains largely unknown. The recent development of machin...

A multi-task convolutional deep neural network for variant calling in single molecule sequencing.

Nature communications
The accurate identification of DNA sequence variants is an important, but challenging task in genomics. It is particularly difficult for single molecule sequencing, which has a per-nucleotide error rate of ~5-15%. Meeting this demand, we developed Cl...

A universal SNP and small-indel variant caller using deep neural networks.

Nature biotechnology
Despite rapid advances in sequencing technologies, accurately calling genetic variants present in an individual genome from billions of short, errorful sequence reads remains challenging. Here we show that a deep convolutional neural network can call...

An ensemble approach to accurately detect somatic mutations using SomaticSeq.

Genome biology
SomaticSeq is an accurate somatic mutation detection pipeline implementing a stochastic boosting algorithm to produce highly accurate somatic mutation calls for both single nucleotide variants and small insertions and deletions. The workflow currentl...

Do protein language models learn phylogeny?

Briefings in bioinformatics
Deep machine learning demonstrates a capacity to uncover evolutionary relationships directly from protein sequences, in effect internalising notions inherent to classical phylogenetic tree inference. We connect these two paradigms by assessing the ca...

SICaRiO: short indel call filtering with boosting.

Briefings in bioinformatics
Despite impressive improvement in the next-generation sequencing technology, reliable detection of indels is still a difficult endeavour. Recognition of true indels is of prime importance in many applications, such as personalized health care, diseas...

Ensemble-Based Somatic Mutation Calling in Cancer Genomes.

Methods in molecular biology (Clifton, N.J.)
Identification of somatic mutations in tumor tissue is challenged by both technical artifacts, diverse somatic mutational processes, and genetic heterogeneity in the tumors. Indeed, recent independent benchmark studies have revealed low concordance b...