IC neuron: An efficient unit to construct neural networks.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

As a popular machine learning method, neural networks can be used to solve many complex tasks. Their strong generalization ability comes from the representation ability of the basic neuron models. The most popular neuron model is the McCulloch-Pitts (MP) neuron, which uses a simple transformation to process the input signal. A common trend is to use the MP neuron to design various neural networks. However, the optimization of the neuron structure is rarely considered. Inspired by the elastic collision model in physics, we propose a new neuron model that can represent more complex distributions. We term it the Inter-layer Collision (IC) neuron which divides the input space into multiple subspaces to represent different linear transformations. Through this operation, the IC neuron enhances the non-linear representation ability and emphasizes useful input features for a given task. We build the IC networks by integrating the IC neurons into the fully-connected, the convolutional, and the recurrent structures. The IC networks outperform the traditional neural networks in a wide range of tasks. Besides, we combine the IC neuron with deep learning models and show the superiority of the IC neuron. Our research proves that the IC neuron can be an effective basic unit to build network structures and make the network performance better.

Authors

  • Junyi An
    State Key Laboratory for Novel Software Technology, Nanjing University, China; Department of Computer Science and Technology, Nanjing University, China. Electronic address: dz1833001@smail.nju.edu.cn.
  • Fengshan Liu
    State Key Laboratory for Novel Software Technology, Nanjing University, China; Department of Computer Science and Technology, Nanjing University, China. Electronic address: fengshanliuf@gmail.com.
  • Furao Shen
  • Jian Zhao
    Key Laboratory of Intelligent Rehabilitation and Barrier-Free for the Disabled (Changchun University), Ministry of Education, Changchun University, Changchun 130012, China.
  • Ruotong Li
    State Key Laboratory for Novel Software Technology, Nanjing University, China; School of Artificial Intelligence, Nanjing University, China. Electronic address: 181220030@smail.nju.edu.cn.
  • Kepan Gao
    State Key Laboratory for Novel Software Technology, Nanjing University, China; Department of Computer Science and Technology, Nanjing University, China. Electronic address: gaokp@smail.nju.edu.cn.