Directly training temporal Spiking Neural Network with sparse surrogate gradient.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Brain-inspired Spiking Neural Networks (SNNs) have attracted much attention due to their event-based computing and energy-efficient features. However, the spiking all-or-none nature has prevented direct training of SNNs for various applications. The surrogate gradient (SG) algorithm has recently enabled spiking neural networks to shine in neuromorphic hardware. However, introducing surrogate gradients has caused SNNs to lose their original sparsity, thus leading to the potential performance loss. In this paper, we first analyze the current problem of direct training using SGs and then propose Masked Surrogate Gradients (MSGs) to balance the effectiveness of training and the sparseness of the gradient, thereby improving the generalization ability of SNNs. Moreover, we introduce a temporally weighted output (TWO) method to decode the network output, reinforcing the importance of correct timesteps. Extensive experiments on diverse network structures and datasets show that training with MSG and TWO surpasses the SOTA technique.

Authors

  • Yang Li
    Occupation of Chinese Center for Disease Control and Prevention, Beijing, China.
  • Feifei Zhao
    Brain-inspired Cognitive Intelligence Lab, Institute of Automation, Chinese Academy of Sciences (CAS), Beijing, China; School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China. Electronic address: zhaofeifei2014@ia.ac.cn.
  • Dongcheng Zhao
    Research Center for Brain-Inspired Intelligence, Institute of Automation, Chinese Academy of Sciences, Beijing, China. Electronic address: zhaodongcheng2016@ia.ac.cn.
  • Yi Zeng
    Department of Geriatrics, Second Xiangya Hospital, Central South University, Changsha, 410011, Hunan, China.