Ensemble stacking rockburst prediction model based on Yeo-Johnson, K-means SMOTE, and optimal rockburst feature dimension determination.

Journal: Scientific reports
Published Date:

Abstract

Rockburst forecasting plays a crucial role in prevention and control of rockburst disaster. To improve the accuracy of rockburst prediction at the data structure and algorithm levels, the Yeo-Johnson transform, K-means SMOTE oversampling, and optimal rockburst feature dimension determination are used to optimize the data structure. At the algorithm optimization level, ensemble stacking rockburst prediction is performed based on the data structure optimization. First, to solve the problem of many outliers and data imbalance in the distribution of rockburst data, the Yeo-Johnson transform and k-means SMOTE algorithm are respectively used to solve the problems. Then, based on six original rockburst features, 21 new features are generated using the PolynomialFeatures function in Sklearn. Principal component analysis (PCA) dimensionality reduction is applied to eliminate the correlations between the 27 features. Thirteen types of machine learning algorithms are used to predict datasets that retain different numbers of features after dimensionality reduction to determine the optimal rockburst feature dimension. Finally, the 14-feature rockburst dataset is used as the input for integrated stacking. The results show that the ensemble stacking model based on Yeo-Johnson, K-means SMOTE, and optimal rockburst feature dimension determination can improve the accuracy of rockburst prediction by 0.1602-0.3636. Compared with the 13 single machine learning models without data preprocessing, this data structure optimization and algorithm optimization method effectively improves the accuracy of rockburst prediction.

Authors

  • Lijun Sun
    Media Laboratory, Systems, & Society, Massachusetts Institute of Technology, Cambridge, MA, USA.
  • Nanyan Hu
    School of Resource and Environmental Engineering, Wuhan University of Science and Technology, Wuhan, 430081, Hubei, People's Republic of China.
  • Yicheng Ye
    School of Resource and Environmental Engineering, Wuhan University of Science and Technology, Wuhan, 430081, Hubei, People's Republic of China.
  • Wenkan Tan
    School of Resources and Environmental Engineering, Wuhan University of Science and Technology, Wuhan, 430081, Hubei, China.
  • Menglong Wu
    School of Resources and Environmental Engineering, Wuhan University of Science and Technology, Wuhan, 430081, Hubei, China.
  • Xianhua Wang
    Department of Quality Control of Changji Autonomous Prefecture Center for Disease Control and Prevention, 831100, China.
  • Zhaoyun Huang
    Hubei Jingshen Safety Technology Co., Ltd., Yichang, 443000, Hubei, China.