SeReNe: Sensitivity-Based Regularization of Neurons for Structured Sparsity in Neural Networks.

Journal: IEEE transactions on neural networks and learning systems
Published Date:

Abstract

Deep neural networks include millions of learnable parameters, making their deployment over resource-constrained devices problematic. Sensitivity-based regularization of neurons (SeReNe) is a method for learning sparse topologies with a structure, exploiting neural sensitivity as a regularizer. We define the sensitivity of a neuron as the variation of the network output with respect to the variation of the activity of the neuron. The lower the sensitivity of a neuron, the less the network output is perturbed if the neuron output changes. By including the neuron sensitivity in the cost function as a regularization term, we are able to prune neurons with low sensitivity. As entire neurons are pruned rather than single parameters, practical network footprint reduction becomes possible. Our experimental results on multiple network architectures and datasets yield competitive compression ratios with respect to state-of-the-art references.

Authors

  • Enzo Tartaglione
    Computer Science Department, University of Turin, 10149 Torino, Italy.
  • Andrea Bragagnolo
    Università degli Studi di Torino, corso Svizzera 185, Torino, Italy.
  • Francesco Odierna
  • Attilio Fiandrotti
    Università degli Studi di Torino, corso Svizzera 185, Torino, Italy.
  • Marco Grangetto
    Computer Science Department, University of Turin, 10149 Torino, Italy.