Using Machine Learning to Identify True Somatic Variants from Next-Generation Sequencing.
Journal:
Clinical chemistry
Published Date:
Jan 1, 2020
Abstract
BACKGROUND: Molecular profiling has become essential for tumor risk stratification and treatment selection. However, cancer genome complexity and technical artifacts make identification of real variants a challenge. Currently, clinical laboratories rely on manual screening, which is costly, subjective, and not scalable. We present a machine learning-based method to distinguish artifacts from bona fide single-nucleotide variants (SNVs) detected by next-generation sequencing from nonformalin-fixed paraffin-embedded tumor specimens.