Human host status inference from temporal microbiome changes via recurrent neural networks.

Journal: Briefings in bioinformatics
Published Date:

Abstract

With the rapid increase in sequencing data, human host status inference (e.g. healthy or sick) from microbiome data has become an important issue. Existing studies are mostly based on single-point microbiome composition, while it is rare that the host status is predicted from longitudinal microbiome data. However, single-point-based methods cannot capture the dynamic patterns between the temporal changes and host status. Therefore, it remains challenging to build good predictive models as well as scaling to different microbiome contexts. On the other hand, existing methods are mainly targeted for disease prediction and seldom investigate other host statuses. To fill the gap, we propose a comprehensive deep learning-based framework that utilizes longitudinal microbiome data as input to infer the human host status. Specifically, the framework is composed of specific data preparation strategies and a recurrent neural network tailored for longitudinal microbiome data. In experiments, we evaluated the proposed method on both semi-synthetic and real datasets based on different sequencing technologies and metagenomic contexts. The results indicate that our method achieves robust performance compared to other baseline and state-of-the-art classifiers and provides a significant reduction in prediction time.

Authors

  • Xingjian Chen
    School of Information Science and Technology, Nanjing Agricultural University, Nanjing 210095, Jiangsu, China.
  • Lingjing Liu
    Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong SAR.
  • Weitong Zhang
    Department of Computer Science, City University of Hong Kong, Kowloon Tong, Hong Kong SAR.
  • Jianyi Yang
    School of Mathematical Sciences, Nankai University, Tianjin, China.
  • Ka-Chun Wong