Low-degree term first in ResNet, its variants and the whole neural network family.

Journal: Neural networks : the official journal of the International Neural Network Society

Published Date: Jan 24, 2022

Abstract

To explain the working mechanism of ResNet and its variants, this paper proposes a novel argument of shallow subnetwork first (SSF), essentially low-degree term first (LDTF), which also applies to the whole neural network family. A neural network with shortcut connections behaves as an ensemble of a number of subnetworks of differing depths. Among the subnetworks, the shallow subnetworks are trained firstly, having great effects on the performance of the neural network. The shallow subnetworks roughly correspond to low-degree polynomials, while the deep subnetworks are opposite. Based on Taylor expansion, SSF is consistent with LDTF. ResNet is in line with Taylor expansion: shallow subnetworks are trained firstly to keep low-degree terms, avoiding overfitting; deep subnetworks try to maintain high-degree terms, ensuring high description capacity. Experiments on ResNets and DenseNets show that shallow subnetworks are trained firstly and play important roles in the training of the networks. The experiments also reveal the reason why DenseNets outperform ResNets: The subnetworks playing vital roles in the training of the former are shallower than those in the training of the latter. Furthermore, LDTF can also be used to explain the working mechanism of other ResNet variants (SE-ResNets and SK-ResNets), and the common phenomena occurring in many neural networks.

Authors

Tongfeng Sun

School of Computer Science and Technology, China University of Mining and Technology, Xuzhou 221116, China; Mine Digitization Engineering Research Centre of Ministry of Education of the People's Republic of China, Xuzhou 221116, China. Electronic address: suntf@cumt.edu.cn.
Shifei Ding

School of Computer Science and Technology, China University of Mining and Technology, Xuzhou 221116, China; Mine Digitization Engineering Research Centre of Ministry of Education of the People's Republic of China, Xuzhou 221116, China. Electronic address: dingsf@cumt.edu.cn.
Lili Guo

School of Computer Science and Technology, China University of Mining and Technology, Xuzhou 221116, China; Mine Digitization Engineering Research Centre of Ministry of Education of the People's Republic of China, Xuzhou 221116, China. Electronic address: liliguo@cumt.edu.cn.

Keywords

Algorithms Neural Networks, Computer

External Resources

View on PubMed Access via DOI PubMed (35134597)

Low-degree term first in ResNet, its variants and the whole neural network family.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

Low-degree term first in ResNet, its variants and the whole neural network family.

Abstract

Authors

Keywords

External Resources

Stay Ahead of Medical AI

Popular Topics

Recent Journals