FORCE trained spiking networks do not benefit from faster learning while parameter matched rate networks do
Journal:
bioRxiv
Published Date:
Jan 1, 2025
Abstract
Training spiking recurrent neural networks (SRNNs) presents significant challenges compared to standard recurrent neural networks (RNNs) that model neural firing rates more directly. Here, we investigate the origins of these difficulties by training networks of spiking neurons and their parameter-matched instantaneous rate-based RNNs on supervised learning tasks. We applied FORCE training to leaky integrate-and-fire spiking networks and their matched rate-based counterparts across various dynamical tasks, keeping the FORCE hyperparameters identical. We found that at slow learning rates, spiking and rate networks behaved similarly: FORCE training identified highly correlated weight matrix solutions, and both network types exhibited overlapping hyperparameter regions for successful convergence. Remarkably, these weight solutions were largely interchangeable—weights trained in the spiking network could be transferred to the rate network and vice versa while preserving correct dynamical decoding. However, at fast learning rates, the correlation between learned solutions dropped sharply, and the solutions were no longer fully interchangeable. Despite this, rate networks still functioned well when their weight matrices were replaced with those learned from spiking networks. Additionally, the two network types exhibited distinct behaviours across different sizes: faster learning improved performance in rate networks but had little effect in spiking networks, aside from increasing instability. Through analytic derivation, we further show that slower learning rates in FORCE effectively act as a low-pass filter on the principal components of the neural bases, selectively stabilizing the dominant correlated components across spiking and rate networks. Our results indicate that some of the difficulties in training spiking networks stem from the inherent spike-time variability in spiking systems—variability that is not present in rate networks. These challenges can be mitigated in FORCE training by selecting appropriately slow learning rates. Moreover, our findings suggest that the decoding solutions learned by FORCE for spiking networks approximate a cross-trial firing rate-based decoding. Training spiking neural networks is much harder compared to training standard recurrent neural networks that are more closely tied to neural firing rates. To understand why, we trained parameter matched spiking and rate-based networks on the same supervised learning tasks with the FORCE technique. We found that the learned spike weights were highly correlated and interchangeable across spiking and firing rate networks for slow learning rates. However, when both networks learn fast, the spiking networks show no tangible improvements in their performance in comparison to the rate networks, with instabilities caused by faster learning in the spiking network. These networks also discover uncorrelated solutions to their weights when the learning is fast, that are only interchangeable in one direction, from spike to rate. This suggests that the decoding solutions learned by FORCE for spiking networks approximate a cross-trial firing rate-based decoding. We then analytically determine that, with slower learning rates, FORCE acts as a low-pass filter on the principal components of the neural bases, where the leading components are highly correlated across spiking and rate networks.