Vector-based navigation using grid-like representations in artificial agents.

Journal: Nature
PMID:

Abstract

Deep neural networks have achieved impressive successes in fields ranging from object recognition to complex games such as Go. Navigation, however, remains a substantial challenge for artificial agents, with deep neural networks trained by reinforcement learning failing to rival the proficiency of mammalian spatial behaviour, which is underpinned by grid cells in the entorhinal cortex . Grid cells are thought to provide a multi-scale periodic representation that functions as a metric for coding space and is critical for integrating self-motion (path integration) and planning direct trajectories to goals (vector-based navigation). Here we set out to leverage the computational functions of grid cells to develop a deep reinforcement learning agent with mammal-like navigational abilities. We first trained a recurrent network to perform path integration, leading to the emergence of representations resembling grid cells, as well as other entorhinal cell types . We then showed that this representation provided an effective basis for an agent to locate goals in challenging, unfamiliar, and changeable environments-optimizing the primary objective of navigation through deep reinforcement learning. The performance of agents endowed with grid-like representations surpassed that of an expert human and comparison agents, with the metric quantities necessary for vector-based navigation derived from grid-like units within the network. Furthermore, grid-like representations enabled agents to conduct shortcut behaviours reminiscent of those performed by mammals. Our findings show that emergent grid-like representations furnish agents with a Euclidean spatial metric and associated vector operations, providing a foundation for proficient navigation. As such, our results support neuroscientific theories that see grid cells as critical for vector-based navigation, demonstrating that the latter can be combined with path-based strategies to support navigation in challenging environments.

Authors

  • Andrea Banino
    DeepMind, London, UK. abanino@google.com.
  • Caswell Barry
    UCL Department of Cell and Developmental Biology, Gower Street, London, WC1E 6BT, UK.
  • Benigno Uria
    DeepMind, London, UK.
  • Charles Blundell
    DeepMind, London, UK.
  • Timothy Lillicrap
    Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Piotr Mirowski
    DeepMind, London, UK.
  • Alexander Pritzel
    DeepMind, London, UK.
  • Martin J Chadwick
    DeepMind, London, UK.
  • Thomas Degris
    DeepMind, London, UK.
  • Joseph Modayil
    DeepMind, London, UK.
  • Greg Wayne
    DeepMind,London N1 9DR,UK.gregwayne@gmail.com.
  • Hubert Soyer
    DeepMind, London, UK.
  • Fabio Viola
    DeepMind, London, UK.
  • Brian Zhang
    DeepMind, London, UK.
  • Ross Goroshin
    DeepMind, London, UK.
  • Neil Rabinowitz
    DeepMind, London EC4 5TW, United Kingdom.
  • Razvan Pascanu
    DeepMind, London EC4 5TW, United Kingdom.
  • Charlie Beattie
    DeepMind, London, UK.
  • Stig Petersen
    Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Amir Sadik
    Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Stephen Gaffney
    DeepMind, London, UK.
  • Helen King
    Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Koray Kavukcuoglu
    Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Demis Hassabis
    Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Raia Hadsell
    DeepMind, London EC4 5TW, United Kingdom.
  • Dharshan Kumaran
    Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.