AIMC Topic: High-Throughput Nucleotide Sequencing

Clear Filters Showing 131 to 140 of 341 articles

Improving protein domain classification for third-generation sequencing reads using deep learning.

BMC genomics
BACKGROUND: With the development of third-generation sequencing (TGS) technologies, people are able to obtain DNA sequences with lengths from 10s to 100s of kb. These long reads allow protein domain annotation without assembly, thus can produce impor...

Optimal Bayesian Transfer Learning for Count Data.

IEEE/ACM transactions on computational biology and bioinformatics
There is often a limited amount of omics data to design predictive models in biomedicine. Knowing that these omics data come from underlying processes that may share common pathways and disease mechanisms, it may be beneficial for designing a more ac...

DCMD: Distance-based classification using mixture distributions on microbiome data.

PLoS computational biology
Current advances in next-generation sequencing techniques have allowed researchers to conduct comprehensive research on the microbiome and human diseases, with recent studies identifying associations between the human microbiome and health outcomes f...

DeepTCR is a deep learning framework for revealing sequence concepts within T-cell repertoires.

Nature communications
Deep learning algorithms have been utilized to achieve enhanced performance in pattern-recognition tasks. The ability to learn complex patterns in data has tremendous implications in immunogenomics. T-cell receptor (TCR) sequencing assesses the diver...

seqQscorer: automated quality control of next-generation sequencing data using machine learning.

Genome biology
Controlling quality of next-generation sequencing (NGS) data files is a necessary but complex task. To address this problem, we statistically characterize common NGS quality features and develop a novel quality control procedure involving tree-based ...

Use of machine learning to identify a T cell response to SARS-CoV-2.

Cell reports. Medicine
The identification of SARS-CoV-2-specific T cell receptor (TCR) sequences is critical for understanding T cell responses to SARS-CoV-2. Accordingly, we reanalyze publicly available data from SARS-CoV-2-recovered patients who had low-severity disease ...

Machine learning reveals bilateral distribution of somatic L1 insertions in human neurons and glia.

Nature neuroscience
Retrotransposons can cause somatic genome variation in the human nervous system, which is hypothesized to have relevance to brain development and neuropsychiatric disease. However, the detection of individual somatic mobile element insertions present...

Cnngeno: A high-precision deep learning based strategy for the calling of structural variation genotype.

Computational biology and chemistry
Genotype plays a significant role in determining characteristics in an organism and genotype calling has been greatly accelerated by sequencing technologies. Furthermore, most parametric statistical models are unable to effectively call genotype, whi...

Deep learning predicts short non-coding RNA functions from only raw sequence data.

PLoS computational biology
Small non-coding RNAs (ncRNAs) are short non-coding sequences involved in gene regulation in many biological processes and diseases. The lack of a complete comprehension of their biological functionality, especially in a genome-wide scenario, has dem...