ConsensuSV-ONT - A modern method for accurate structural variant calling.

Journal: Scientific reports
Published Date:

Abstract

Improvements in sequencing technology make the development of new tools for detection of structural variance more and more common. However, since the tools available for the long-read Oxford Nanopore sequencing are limited, and the selection of the optimal tool presents a challenge, there is a need to create a tool based on Consensus that combines existing work to identify a set of high-quality, reliable structural variants that can be used for further downstream analysis. The field has also been subject to revolution in machine learning techniques, especially deep learning. To address the aforementioned need and developments, we propose a novel, fully automated ConsensuSV-ONT algorithm. The method uses six independent, state-of-the-art structural variant callers for long-read sequencing along with a convolutional neural network for filtering high-quality variants. We provide a runtime environment in the form of a docker image, wrapping a nextflow pipeline for efficient processing using parallel computing. The solution is complete in its form and is ready to use not only by computer scientists but accessible and easy to use for everyone working with Oxford Nanopore long-read sequencing data.

Authors

  • Antoni Pietryga
    Laboratory of Bioinformatics and Computational Genomics, Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland.
  • Mateusz Chiliński
    Laboratory of Bioinformatics and Computational Genomics, Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland.
  • Sachin Gadakh
    Laboratory of Functional and Structural Genomics, Centre of New Technologies, University of Warsaw, Warsaw, Poland.
  • Dariusz Plewczynski
    Center of New Technologies, University of Warsaw, 02097 Warszawa, Poland.