An interpretive constrained linear model for ResNet and MgNet.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

We propose a constrained linear data-feature-mapping model as an interpretable mathematical model for image classification using a convolutional neural network (CNN). From this viewpoint, we establish detailed connections between the traditional iterative schemes for linear systems and the architectures of the basic blocks of ResNet- and MgNet-type models. Using these connections, we present some modified ResNet models that, compared with the original models, have fewer parameters but can produce more accurate results, thereby demonstrating the validity of this constrained learning data-feature-mapping assumption. Based on this assumption, we further propose a general data-feature iterative scheme to demonstrate the rationality of MgNet. We also provide a systematic numerical study on MgNet to show its success and advantages in image classification problems, particularly in comparison with established networks.

Authors

  • Juncai He
    Computer, Electrical and Mathematical Science and Engineering Division, King Abdullah University of Science and Technology, Thuwal 23955, Saudi Arabia. Electronic address: juncai.he@kaust.edu.sa.
  • Jinchao Xu
    Department of Mathematics, Pennsylvania State University, State College, Pennsylvania.
  • Lian Zhang
    Department of Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN, USA.
  • Jianqing Zhu
    Engineering Research Centre in Industrial Intellectual Techniques and Systems of Fujian Providence College of Engineering, Huaqiao University, Chenghua North Road, Fengze District, Quanzhou, Fujian 362021, China. Electronic address: jqzhu@hqu.edu.cn.