An integrated approach based on virtual data augmentation and deep neural networks modeling for VFA production prediction in anaerobic fermentation process.

Journal: Water research
Published Date:

Abstract

Data-driven models are suitable for simulating biological wastewater treatment processes with complex intrinsic mechanisms. However, raw data collected in the early stage of biological experiments are normally not enough to train data-driven models. In this study, an integrated modeling approach incorporating the random standard deviation sampling (RSDS) method and deep neural networks (DNNs) models, was established to predict volatile fatty acid (VFA) production in the anaerobic fermentation process. The RSDS method based on the mean values (x¯) and standard deviations (α) calculated from multiple experimental determination was initially developed for virtual data augmentation. The DNNs models were then established to learn features from virtual data and predict VFA production. The results showed that when 20000 virtual samples including five input variables of the anaerobic fermentation process were used to train the DNNs model with 16 hidden layers and 100 hidden neurons in each layer, the best correlation coefficient of 0.998 and the minimal mean absolute percentage error of 3.28% were achieved. This integrated approach can learn nonlinear information from virtual data generated by the RSDS method, and consequently enlarge the application range of DNNs models in simulating biological wastewater treatment processes with small datasets.

Authors

  • Run-Ze Xu
    Key Laboratory of Integrated Regulation and Resource Development on Shallow Lakes, Ministry of Education, College of Environment, Hohai University, Nanjing, 210098, China.
  • Jia-Shun Cao
    Key Laboratory of Integrated Regulation and Resource Development on Shallow Lakes, Ministry of Education, College of Environment, Hohai University, Nanjing, 210098, China; Guohe Environmental Research Institute (Nanjing) Co., Ltd, Nanjing, 211599, China.
  • Yang Wu
  • Su-Na Wang
    Key Laboratory of Integrated Regulation and Resource Development on Shallow Lakes, Ministry of Education, College of Environment, Hohai University, Nanjing, 210098, China.
  • Jing-Yang Luo
    Key Laboratory of Integrated Regulation and Resource Development on Shallow Lakes, Ministry of Education, College of Environment, Hohai University, Nanjing, 210098, China; Guohe Environmental Research Institute (Nanjing) Co., Ltd, Nanjing, 211599, China.
  • Xueming Chen
    College of Environment and Resources, Fuzhou University, Fujian, 350116, China.
  • Fang Fang
    Department of Cardiology, Central War Zone General Hospital of the Chinese People's Liberation Army, Wuhan 430061, China.