NmTHC: a hybrid error correction method based on a generative neural machine translation model with transfer learning.

Journal: BMC genomics
PMID:

Abstract

BACKGROUNDS: The single-pass long reads generated by third-generation sequencing technology exhibit a higher error rate. However, the circular consensus sequencing (CCS) produces shorter reads. Thus, it is effective to manage the error rate of long reads algorithmically with the help of the homologous high-precision and low-cost short reads from the Next Generation Sequencing (NGS) technology.

Authors

  • Rongshu Wang
    Department of Electronic Engineering, Information School, Yunnan University, Kunming, Yunnan, China.
  • Jianhua Chen
    Department of Electronic Engineering, Information School, Yunnan University, Kunming, Yunnan 650091, China.