Discrete Missing Data Imputation Using Multilayer Perceptron and Momentum Gradient Descent.

Journal: Sensors (Basel, Switzerland)
Published Date:

Abstract

Data are a strategic resource for industrial production, and an efficient data-mining process will increase productivity. However, there exist many missing values in data collected in real life due to various problems. Because the missing data may reduce productivity, missing value imputation is an important research topic in data mining. At present, most studies mainly focus on imputation methods for continuous missing data, while a few concentrate on discrete missing data. In this paper, a discrete missing value imputation method based on a multilayer perceptron (MLP) is proposed, which employs a momentum gradient descent algorithm, and some prefilling strategies are utilized to improve the convergence speed of the MLP. To verify the effectiveness of the method, experiments are conducted to compare the classification accuracy with eight common imputation methods, such as the mode, random, hot-deck, KNN, autoencoder, and MLP, under different missing mechanisms and missing proportions. Experimental results verify that the improved MLP model (IMLP) can effectively impute discrete missing values in most situations under three missing patterns.

Authors

  • Hu Pan
    School of Computer Science, Hubei University of Technology, Wuhan 430068, China.
  • Zhiwei Ye
    School of Computer Science, Hubei University of Technology, Wuhan 430068, China.
  • Qiyi He
    Department of Pharmaceutical Engineering, Faculty of Chemical Engineering and Light Industry, Guangdong University of Technology, Guangzhou 510006, People's Republic of China. Electronic address: chesto36@163.com.
  • Chunyan Yan
    School of Computer Science, Hubei University of Technology, Wuhan 430068, China.
  • Jianyu Yuan
    School of Computer Science, Hubei University of Technology, Wuhan 430068, China.
  • Xudong Lai
    School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430072, China.
  • Jun Su
    Department of Ultrasound, The Second Clinical Medical College, North Sichuan Medical College, Nan Chong, China.
  • Ruihan Li
    School of Computer Science, Hubei University of Technology, Wuhan 430068, China.