Impact of microbial profile integration on machine learning predictions of methane production: synergies and trade-offs with physicochemical parameters.

Journal: Bioresource technology
Published Date:

Abstract

Microbial sequencing data were rarely integrated into the prediction of methane production using machine learning (ML) models because of high dimensionality and the lack of a systematic way to evaluate the change of insight gained from modelling with only physicochemical information. Here, key taxa were extracted with co-occurrence network analysis to reduce the dimension of the microbial profile. With 101 datasets with paired microbial and physiochemical features, integrating microbial features significantly enhanced accuracy for predicting methane production, increasing average R from 0.73 to 0.79 and reducing mean absolute error from 8.7 to 8.0. Notably, integrating microbial features altered physicochemical feature impacts, shifting both their importance and directional effects. This underscores how microbial data refine mechanistic understanding and synergistically improve prediction accuracy, addressing a key gap left by models relying solely on physicochemical parameters. The work advocates for systematic microbial feature inclusion to advance methane production modelling with ML frameworks.

Authors

  • Hongyu Dang
    Department of Civil and Environmental Engineering, University of Alberta, Edmonton, Alberta T6G 1H9, Canada.
  • Najiaowa Yu
    Department of Civil and Environmental Engineering, University of Alberta, Edmonton, Alberta T6G 1H9, Canada.
  • Anqi Mou
    Department of Civil and Environmental Engineering, University of Alberta, Edmonton, Alberta T6G 1H9, Canada.
  • Yingdi Zhang
    Department of Civil and Environmental Engineering, University of Alberta, Edmonton, Alberta T6G 1H9, Canada.
  • Huijuan Sun
    Jinzhou First People's Hospital, Dalian, China.
  • Mengjiao Gao
    Department of Civil and Environmental Engineering, University of Alberta, Edmonton, Alberta T6G 1H9, Canada.
  • Lei Zhang
    Division of Gastroenterology, Union Hospital, Tongji Medical College Medical College, Huazhong University of Science and Technology, Wuhan, China.
  • Yiyang Yuan
    Department of Civil and Environmental Engineering, University of Alberta, Edmonton, Alberta T6G 1H9, Canada.
  • Huichun Zhang
    Department of Civil and Environmental Engineering, Case Western Reserve University, Cleveland, Ohio 44106, United States.
  • Yang Liu
    Department of Computer Science, Hong Kong Baptist University, Hong Kong, China.