Neural scene representation and rendering.

Journal: Science (New York, N.Y.)
Published Date:

Abstract

Scene representation-the process of converting visual sensory data into concise descriptions-is a requirement for intelligent behavior. Recent work has shown that neural networks excel at this task when provided with large, labeled datasets. However, removing the reliance on human labeling remains an important open problem. To this end, we introduce the Generative Query Network (GQN), a framework within which machines learn to represent scenes using only their own sensors. The GQN takes as input images of a scene taken from different viewpoints, constructs an internal representation, and uses this representation to predict the appearance of that scene from previously unobserved viewpoints. The GQN demonstrates representation learning without human labels or domain knowledge, paving the way toward machines that autonomously learn to understand the world around them.

Authors

  • S M Ali Eslami
    DeepMind, 5 New Street Square, London EC4A 3TW, UK. aeslami@google.com.
  • Danilo Jimenez Rezende
    DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Frederic Besse
    DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Fabio Viola
    DeepMind, London, UK.
  • Ari S Morcos
    DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Marta Garnelo
    DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Avraham Ruderman
    DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Andrei A Rusu
    Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Ivo Danihelka
    DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Karol Gregor
    DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • David P Reichert
    DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Lars Buesing
    DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Theophane Weber
    DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Oriol Vinyals
    DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Dan Rosenbaum
    DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Neil Rabinowitz
    DeepMind, London EC4 5TW, United Kingdom.
  • Helen King
    Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Chloe Hillier
    DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Matt Botvinick
    DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Daan Wierstra
    Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Koray Kavukcuoglu
    Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Demis Hassabis
    Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.