GSEnet: feature extraction of gene expression data and its application to Leukemia classification.

Journal: Mathematical biosciences and engineering : MBE
Published Date:

Abstract

Gene expression data is highly dimensional. As disease-related genes account for only a tiny fraction, a deep learning model, namely GSEnet, is proposed to extract instructive features from gene expression data. This model consists of three modules, namely the pre-conv module, the SE-Resnet module, and the SE-conv module. Effectiveness of the proposed model on the performance improvement of 9 representative classifiers is evaluated. Seven evaluation metrics are used for this assessment on the GSE99095 dataset. Robustness and advantages of the proposed model compared with representative feature selection methods are also discussed. Results show superiority of the proposed model on the improvement of the classification precision and accuracy.

Authors

  • Kun Yu
    College of Medicine and Biological Information Engineering, Northeastern University, Shenyang, Liaoning 110819, China.
  • Mingxu Huang
    School of Computer Science and Engineering, Northeastern University, Shenyang, Liaoning 110819, China.
  • Shuaizheng Chen
    School of Computer Science and Engineering, Northeastern University, Shenyang, Liaoning 110819, China.
  • Chaolu Feng
    Key Laboratory of Intelligent Computing in Medical Image, Ministry of Education, Shenyang, Liaoning 110819, China.
  • Wei Li
    Department of Nephrology, The Second Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi, China.