DL-CNV: A deep learning method for identifying copy number variations based on next generation target sequencing.

Journal: Mathematical biosciences and engineering : MBE

Published Date: Sep 30, 2019

Abstract

Copy number variations (CNVs) play an important role in many types of cancer. With the rapid development of next generation sequencing (NGS) techniques, many methods for detecting CNVs of a single sample have emerged: (i) require genome-wide data of both case and control samples, (ii) depend on sequencing depth and GC content correction algorithm, (iii) rely on statistical models built on CNV positive and negative sample datasets. These make them costly in the data analysis and ineffective in the targeted sequencing data. In this study, we developed a novel alignment-free method called DL-CNV to call CNV from the target sequencing data of a single sample. Specifically, we collected two sets of samples. The first set consists of 1301 samples, in which 272 have CNVs in ERBB2 and the second set is composed of 1148 samples with 63 samples containing CNVs in MET. Finally, we found that a testing AUC of 0.9454 for ERBB2 and 0.9220 for MET. Furthermore, we hope to make the CNV detection could be more accurate with clinical "gold standard" (e.g. FISH) information and provide a new research direction, which can be used as the supplement to the existing NGS methods.

Authors

Yun Xiang Zhang

Weifang People's Hospital, Guang Wen Road, Weifang 261000, China.
Lv Cheng Jin

Weifang Medical University, Bao Tong West Street, Weifang 261053, China.
Bo Wang

Department of Clinical Laboratory Medicine Center, Inner Mongolia Autonomous Region People's Hospital, Hohhot, Inner Mongolia, China.
De Hong Hu

Weifang People's Hospital, Guang Wen Road, Weifang 261000, China.
Le Qiang Wang

Weifang People's Hospital, Guang Wen Road, Weifang 261000, China.
Pan Li

Department of Infections，Beijing Hospital of Traditional Chinese Medicine, Affiliated to the Capital Medical University, No. 23, Back Road of the Art Gallery, Dongcheng District, Beijing 100010, China.
Jun Ling Zhang

Geneis Beijing Limited Company, Beijing 100102, China.
Kai Han

Geneis Beijing Limited Company, Beijing 100102, China.
Geng Tian

Department of Sciences, Genesis (Beijing) Co. Ltd., Beijing, China.
Da Wei Yuan

Geneis Beijing Limited Company, Beijing 100102, China.
Jia Liang Yang

Geneis Beijing Limited Company, Beijing 100102, China.
Wei Tan

Weifang People's Hospital, Guang Wen Road, Weifang 261000, China.
Xiao Ming Xing

The Affiliated Hospital of Qingdao University, Jiang Su Road, Qingdao 266071, China.
Ji Dong Lang

The Affiliated Hospital of Qingdao University, Jiang Su Road, Qingdao 266071, China.

Keywords

Algorithms Area Under Curve Carcinoma, Non-Small-Cell Lung Databases, Factual Deep Learning DNA Copy Number Variations Exons False Positive Reactions Genome-Wide Association Study Genome, Human High-Throughput Nucleotide Sequencing Humans In Situ Hybridization, Fluorescence Lung Neoplasms Proto-Oncogene Proteins c-met Receptor, ErbB-2 Reproducibility of Results ROC Curve Sensitivity and Specificity

External Resources

View on PubMed Access via DOI PubMed (31731347)

DL-CNV: A deep learning method for identifying copy number variations based on next generation target sequencing.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals