Deep convolutional models improve predictions of macaque V1 responses to natural images.

Journal: PLoS computational biology
Published Date:

Abstract

Despite great efforts over several decades, our best models of primary visual cortex (V1) still predict spiking activity quite poorly when probed with natural stimuli, highlighting our limited understanding of the nonlinear computations in V1. Recently, two approaches based on deep learning have emerged for modeling these nonlinear computations: transfer learning from artificial neural networks trained on object recognition and data-driven convolutional neural network models trained end-to-end on large populations of neurons. Here, we test the ability of both approaches to predict spiking activity in response to natural images in V1 of awake monkeys. We found that the transfer learning approach performed similarly well to the data-driven approach and both outperformed classical linear-nonlinear and wavelet-based feature representations that build on existing theories of V1. Notably, transfer learning using a pre-trained feature space required substantially less experimental time to achieve the same performance. In conclusion, multi-layer convolutional neural networks (CNNs) set the new state of the art for predicting neural responses to natural images in primate V1 and deep features learned for object recognition are better explanations for V1 computation than all previous filter bank theories. This finding strengthens the necessity of V1 models that are multiple nonlinearities away from the image domain and it supports the idea of explaining early visual cortex based on high-level functional goals.

Authors

  • Santiago A Cadena
    Centre for Integrative Neuroscience and Institute for Theoretical Physics, University of Tübingen, Tübingen, Germany.
  • George H Denfield
    Center for Neuroscience and Artificial Intelligence, Baylor College of Medicine, Houston, Texas, United States of America.
  • Edgar Y Walker
    Center for Neuroscience and Artificial Intelligence, Baylor College of Medicine, Houston, Texas, United States of America.
  • Leon A Gatys
    Werner Reichardt Centre for Integrative Neuroscience and Institute of Theoretical Physics, University of Tübingen, Germany; Bernstein Center for Computational Neuroscience, Tübingen, Germany; Graduate School for Neural Information Processing, Tübingen, Germany.
  • Andreas S Tolias
    Bernstein Center for Computational Neuroscience, Tübingen, Germany.
  • Matthias Bethge
    University of Tübingen, Tübingen, Germany.
  • Alexander S Ecker
    Werner Reichardt Centre for Integrative Neuroscience and Institute of Theoretical Physics, University of Tübingen, Germany; Bernstein Center for Computational Neuroscience, Tübingen, Germany; Department of Neuroscience, Baylor College of Medicine, Houston, TX, USA.