Understanding neural network through neuron level visualization.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Neurons are the fundamental units of neural networks. In this paper, we propose a method for explaining neural networks by visualizing the learning process of neurons. For a trained neural network, the proposed method obtains the features learned by each neuron and displays the features in a human-understandable form. The features learned by different neurons are combined to analyze the working mechanism of different neural network models. The method is applicable to neural networks without requiring any changes to the architectures of the models. In this study, we apply the proposed method to both Fully Connected Networks (FCNs) and Convolutional Neural Networks (CNNs) trained using the backpropagation learning algorithm. We conduct experiments on models for image classification tasks to demonstrate the effectiveness of the method. Through these experiments, we gain insights into the working mechanisms of various neural network architectures and evaluate neural network interpretability from diverse perspectives.

Authors

  • Hui Dou
    State Key Laboratory for Novel Software Technology, China; Department of Computer Science and Technology, Nanjing University, Nanjing 210023, China. Electronic address: huidou@smail.nju.edu.cn.
  • Furao Shen
  • Jian Zhao
    Key Laboratory of Intelligent Rehabilitation and Barrier-Free for the Disabled (Changchun University), Ministry of Education, Changchun University, Changchun 130012, China.
  • Xinyu Mu
    State Key Laboratory for Novel Software Technology, China; School of Artificial Intelligence, Nanjing University, Nanjing 210023, China.