Cnngeno: A high-precision deep learning based strategy for the calling of structural variation genotype.

Journal: Computational biology and chemistry
Published Date:

Abstract

Genotype plays a significant role in determining characteristics in an organism and genotype calling has been greatly accelerated by sequencing technologies. Furthermore, most parametric statistical models are unable to effectively call genotype, which is influenced by the size of structural variations and the coverage fluctuations of sequencing data. In this study, we propose a new method for calling deletions' genotypes from the next-generation data, called Cnngeno. Cnngeno can convert sequencing data into images and classifies the genotypes from these images using the convolutional neural network(CNN). Moreover, Cnngeno adopted the convolutional bootstrapping strategy to improve the anti-noisy label's ability. The results show that Cnngeno performs better in terms of precision for calling genotype when compared with other existing methods. The Cnngeno is an open-source method, available at https://github.com/BRF123/Cnngeno.

Authors

  • Ruofei Bai
    Department of Computer Science and Technology, Beijing University of Chemical Technology, Beijing, China.
  • Cheng Ling
    Department of Computer Science and Technology, Beijing University of Chemical Technology, Beijing, China.
  • Lei Cai
    Department of Information Science and Technology, Beijing University of Chemical Technology, Beijing, People's Republic of China.
  • Jingyang Gao
    Department of Computer Science and Technology, Beijing University of Chemical Technology, Beijing, China.