SomaticSeq: An Ensemble and Machine Learning Method to Detect Somatic Mutations.

Journal: Methods in molecular biology (Clifton, N.J.)
Published Date:

Abstract

A standard strategy to discover somatic mutations in a cancer genome is to use next-generation sequencing (NGS) technologies to sequence the tumor tissue and its matched normal (commonly blood or adjacent normal tissue) for side-by-side comparison. However, when interrogating entire genomes (or even just the coding regions), the number of sequencing errors easily outnumbers the number of real somatic mutations by orders of magnitudes. Here, we describe SomaticSeq, which incorporates multiple somatic mutation detection algorithms and then uses machine learning to vastly improve the accuracy of the somatic mutation call sets.

Authors

  • Li Tai Fang
    Bina Technologies, Roche Sequencing, Redwood City, 94065, CA, USA. li\_tai.fang@bina.roche.com.