EnsDeepDP: An Ensemble Deep Learning Approach for Disease Prediction Through Metagenomics.

Journal: IEEE/ACM transactions on computational biology and bioinformatics
Published Date:

Abstract

A growing number of studies show that the human microbiome plays a vital role in human health and can be a crucial factor in predicting certain human diseases. However, microbiome data are often characterized by the limited samples and high-dimensional features, which pose a great challenge for machine learning methods. Therefore, this paper proposes a novel ensemble deep learning disease prediction method that combines unsupervised and supervised learning paradigms. First, unsupervised deep learning methods are used to learn the potential representation of the sample. Afterwards, the disease scoring strategy is developed based on the deep representations as the informative features for ensemble analysis. To ensure the optimal ensemble, a score selection mechanism is constructed, and performance boosting features are engaged with the original sample. Finally, the composite features are trained with gradient boosting classifier for health status decision. For case study, the ensemble deep learning flowchart has been demonstrated on six public datasets extracted from the human microbiome profiling. The results show that compared with the existing algorithms, our framework achieves better performance on disease prediction.

Authors

  • Yang Shen
    Departments of Electrical and Computer Engineering & Computer Science and Engineering Texas A&M University, College Station, TX 77840.
  • Jinlin Zhu
    State Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi, P. R China.
  • Zhaohong Deng
    School of Digital Media, Jiangnan University, Wuxi, Jiangsu, China.
  • Wenwei Lu
    State Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi, P. R China.
  • Hongchao Wang
    State Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi, P. R China.