A deep auto-encoder model for gene expression prediction.

Journal: BMC genomics
Published Date:

Abstract

BACKGROUND: Gene expression is a key intermediate level that genotypes lead to a particular trait. Gene expression is affected by various factors including genotypes of genetic variants. With an aim of delineating the genetic impact on gene expression, we build a deep auto-encoder model to assess how good genetic variants will contribute to gene expression changes. This new deep learning model is a regression-based predictive model based on the MultiLayer Perceptron and Stacked Denoising Auto-encoder (MLP-SAE). The model is trained using a stacked denoising auto-encoder for feature selection and a multilayer perceptron framework for backpropagation. We further improve the model by introducing dropout to prevent overfitting and improve performance.

Authors

  • Rui Xie
    Department of Statistics and Data Science, University of Central Florida, Orlando, FL 32816 USA.
  • Jia Wen
    Department of Bioinformatics and Genomics, College of Computing and Informatics, University of North Carolina at Charlotte, University City Blvd, Charlotte, NC, USA.
  • Andrew Quitadamo
    Department of Bioinformatics and Genomics, College of Computing and Informatics, University of North Carolina at Charlotte, University City Blvd, Charlotte, NC, USA.
  • Jianlin Cheng
    Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA.
  • Xinghua Shi