VBayesMM: variational Bayesian neural network to prioritize important relationships of high-dimensional microbiome multiomics data.

Journal: Briefings in bioinformatics
Published Date:

Abstract

The analysis of high-dimensional microbiome multiomics datasets is crucial for understanding the complex interactions between microbial communities and host physiological states across health and disease conditions. Despite their importance, current methods, such as the microbe-metabolite vectors approach, often face challenges in predicting metabolite abundances from microbial data and identifying keystone species. This arises from the vast dimensionality of metagenomics data, which complicates the inference of significant relationships, particularly the estimation of co-occurrence probabilities between microbes and metabolites. Here we propose the variational Bayesian microbiome multiomics (VBayesMM) approach, which aims to improve the prediction of metabolite abundances from microbial metagenomics data by incorporating a spike-and-slab prior within a Bayesian neural network. This allows VBayesMM to rapidly and precisely identify crucial microbial species, leading to more accurate estimations of co-occurrence probabilities between microbes and metabolites, while also robustly managing the uncertainty inherent in high-dimensional data. Moreover, we have implemented variational inference to address computational bottlenecks, enabling scalable analysis across extensive multiomics datasets. Our large-scale comparative evaluations demonstrate that VBayesMM not only outperforms existing methods in predicting metabolite abundances but also provides a scalable solution for analyzing massive datasets. VBayesMM enhances the interpretability of the Bayesian neural network by identifying a core set of influential microbial species, thus facilitating a deeper understanding of their probabilistic relationships with the host.

Authors

  • Tung Dang
    Laboratory for Medical Science Mathematics, Department of Biological Sciences, School of Science, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan.
  • Artem Lysenko
    Laboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, Japan.
  • Keith A Boroevich
    RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, 230-0045, Japan.
  • Tatsuhiko Tsunoda
    Center for Integrative Medical Sciences, RIKEN Yokohama, Yokohama, 230-0045, Japan. tatsuhiko.tsunoda@riken.jp.