Bacterial classification with convolutional neural networks based on different data reduction layers.

Journal: Nucleosides, nucleotides & nucleic acids
Published Date:

Abstract

For high accuracy classification of DNA sequences through Convolutional Neural Networks (CNNs), it is essential to use an efficient sequence representation that can accelerate similarity comparison between DNA sequences. In addition, CNN networks can be improved by avoiding the dimensionality problem associated with multi-layer CNN features. This paper presents a new approach for classification of bacterial DNA sequences based on a custom layer. A CNN is used with Frequency Chaos Game Representation (FCGR) of DNA. The FCGR is adopted as a sequence representation method with a suitable choice of the frequency k-lengthen words occurrence in DNA sequences. The DNA sequence is mapped using FCGR that produces an image of a gene sequence. This sequence displays both local and global patterns. A pre-trained CNN is built for image classification. First, the image is converted to feature maps through convolutional layers. This is sometimes followed by a down-sampling operation that reduces the spatial size of the feature map and removes redundant spatial information using the pooling layers. The Random Projection (RP) with an activation function, which carries data with a decent variety with some randomness, is suggested instead of the pooling layers. The feature reduction is achieved while keeping the high accuracy for classifying bacteria into taxonomic levels. The simulation results show that the proposed CNN based on RP has a trade-off between accuracy score and processing time.

Authors

  • Samia M Abd-Alhalem
    Department of Electronics and Electrical Communications Engineering, Faculty of Electronic Engineering, Menoufia University, Menouf, Egypt.
  • Naglaa F Soliman
    Electronics and Communication Department Faculty of Engineering, Zagazig University, Zagazig, Egypt.
  • Salah Eldin
  • S E Abd Elrahman
    department of Computer Science and Engineering, Faculty of Electronic Engineering, Menoufia University, Menouf, Egypt.
  • Nabil A Ismail
    department of Computer Science and Engineering, Faculty of Electronic Engineering, Menoufia University, Menouf, Egypt.
  • El-Sayed M El-Rabaie
    Department of Electronics and Electrical Communications Engineering, Faculty of Electronic Engineering, Menoufia University, Menouf, Egypt.
  • Fathi E Abd El-Samie
    Department of Electronics and Electrical Communications Engineering, Faculty of Electronic Engineering, Menoufia University, Menouf, Egypt.