tarSVM: Improving the accuracy of variant calls derived from microfluidic PCR-based targeted next generation sequencing using a support vector machine.

Journal: BMC bioinformatics
Published Date:

Abstract

BACKGROUND: Targeted sequencing of discrete gene sets is a cost effective strategy to screen subjects for monogenic forms of disease. One method to achieve this pairs microfluidic PCR with next generation sequencing. The PCR step of this pipeline creates challenges in accurate variant calling. This includes that most reads targeting a specific exon are duplicates that have been amplified from the PCR step. To reduce false positive variant calls from these experiments, previous studies have used threshold-based filtering of alternative allele depth ratio and manual inspection of the alignments. However even after manual inspection and filtering, many variants fail to be validated via Sanger sequencing. To improve the accuracy of variant calling from these experiments, we are challenged to design a variant filtering strategy that sufficiently models microfluidic PCR-specific issues.

Authors

  • Christopher E Gillies
    Department of Pediatrics-Nephrology, University of Michigan School of Medicine, Ann Arbor, MI, USA.
  • Edgar A Otto
    Department of Internal Medicine-Nephrology, University of Michigan School of Medicine, Ann Arbor, MI, USA.
  • Virginia Vega-Warner
    Department of Pediatrics-Nephrology, University of Michigan School of Medicine, Ann Arbor, MI, USA.
  • Catherine C Robertson
    Department of Pediatrics-Nephrology, University of Michigan School of Medicine, Ann Arbor, MI, USA.
  • Simone Sanna-Cherchi
    Department of Medicine, Columbia University College of Physicians and Surgeons, New York, NY, USA.
  • Ali Gharavi
    Department of Medicine, Columbia University College of Physicians and Surgeons, New York, NY, USA.
  • Brendan Crawford
    Department of Pediatrics-Nephrology, University of Michigan School of Medicine, Ann Arbor, MI, USA.
  • Rajendra Bhimma
    Department of Paediatrics and Child Health, University of KwaZulu Natal, Durban, South Africa.
  • Cheryl Winkler
    NCI, Frederick National Lab for Cancer Research, Molecular Genetics Epidemiology Section, Frederick, MD, USA.
  • Hyun Min Kang
    Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA.
  • Matthew G Sampson
    Department of Pediatrics-Nephrology, University of Michigan School of Medicine, Ann Arbor, MI, USA. mgsamps@med.umich.edu.