MAC-ErrorReads: machine learning-assisted classifier for filtering erroneous NGS reads.

Journal: BMC bioinformatics
PMID:

Abstract

BACKGROUND: The rapid advancement of next-generation sequencing (NGS) machines in terms of speed and affordability has led to the generation of a massive amount of biological data at the expense of data quality as errors become more prevalent. This introduces the need to utilize different approaches to detect and filtrate errors, and data quality assurance is moved from the hardware space to the software preprocessing stages.

Authors

  • Amira Sami
    Department of Computer Science, Faculty of Computers and Information, Mansoura University, P.O. Box: 35516, Mansoura, Egypt.
  • Sara El-Metwally
    Department of Computer Science, Faculty of Computers and Information, Mansoura University, P.O. Box: 35516, Mansoura, Egypt. sarah_almetwally4@mans.edu.eg.
  • M Z Rashad
    Computer Science Department, Faculty of Computers and Information, Mansoura University, Mansoura, Dakahlia Governorate, Egypt.