Prediction of anaerobic digestion performance and identification of critical operational parameters using machine learning algorithms.

Journal: Bioresource technology
Published Date:

Abstract

Machine learning has emerges as a novel method for model development and has potential to be used to predict and control the performance of anaerobic digesters. In this study, several machine learning algorithms were applied in regression and classification models on digestion performance to identify determinant operational parameters and predict methane production. In the regression models, k-nearest neighbors (KNN) algorithm demonstrates optimal prediction accuracy (root mean square error = 26.6, with the dataset range of 259.0-573.8), after narrowing prediction coverage by excluding extreme outliers from the validation set. In the classification models, logistic regression multiclass algorithm yields the best prediction accuracy of 0.73. Feature importance reveals that total carbon was the determinant operational parameter. These results demonstrate the great potential of using machine learning algorithms to predict anaerobic digestion performance.

Authors

  • Luguang Wang
    Department of Biological and Ecological Engineering, Oregon State University, Corvallis, OR 97333, USA.
  • Fei Long
    Department of Biological and Ecological Engineering, Oregon State University, Corvallis, OR 97333, USA.
  • Wei Liao
    Department of Surgery, Hepatic Surgery Center, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.
  • Hong Liu
    Key Laboratory of Grain and Oil Processing and Food Safety of Sichuan Province, College of Food and Bioengineering, Xihua University Chengdu 610039 China xingyage1@163.com.