A novel deep learning method for predictive modeling of microbiome data.

Journal: Briefings in bioinformatics
Published Date:

Abstract

With the development and decreasing cost of next-generation sequencing technologies, the study of the human microbiome has become a rapid expanding research field, which provides an unprecedented opportunity in various clinical applications such as drug response predictions and disease diagnosis. It is thus essential and desirable to build a prediction model for clinical outcomes based on microbiome data that usually consist of taxon abundance and a phylogenetic tree. Importantly, all microbial species are not uniformly distributed in the phylogenetic tree but tend to be clustered at different phylogenetic depths. Therefore, the phylogenetic tree represents a unique correlation structure of microbiome, which can be an important prior to improve the prediction performance. However, prediction methods that consider the phylogenetic tree in an efficient and rigorous way are under-developed. Here, we develop a novel deep learning prediction method MDeep (microbiome-based deep learning method) to predict both continuous and binary outcomes. Conceptually, MDeep designs convolutional layers to mimic taxonomic ranks with multiple convolutional filters on each convolutional layer to capture the phylogenetic correlation among microbial species in a local receptive field and maintain the correlation structure across different convolutional layers via feature mapping. Taken together, the convolutional layers with its built-in convolutional filters capture microbial signals at different taxonomic levels while encouraging local smoothing and preserving local connectivity induced by the phylogenetic tree. We use both simulation studies and real data applications to demonstrate that MDeep outperforms competing methods in both regression and binary classifications. Availability and Implementation: MDeep software is available at https://github.com/lichen-lab/MDeep Contact:chen61@iu.edu.

Authors

  • Ye Wang
    College of Computer Science and Technology, Jilin University, Changchun 130012, China.
  • Tathagata Bhattacharya
  • Yuchao Jiang
  • Xiao Qin
    Worcester Polytechnic Institute, 100 Institute Rd, Worcester, MA, 01609, USA.
  • Yue Wang
    Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States.
  • Yunlong Liu
    Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, 46202, USA.
  • Andrew J Saykin
    Indiana University, Indianapolis, IN 46202, USA.
  • Li Chen
    Department of Endocrinology and Metabolism, Qilu Hospital, Shandong University, Jinan, China.