Learning agile soccer skills for a bipedal robot with deep reinforcement learning.

Journal: Science robotics
Published Date:

Abstract

We investigated whether deep reinforcement learning (deep RL) is able to synthesize sophisticated and safe movement skills for a low-cost, miniature humanoid robot that can be composed into complex behavioral strategies. We used deep RL to train a humanoid robot to play a simplified one-versus-one soccer game. The resulting agent exhibits robust and dynamic movement skills, such as rapid fall recovery, walking, turning, and kicking, and it transitions between them in a smooth and efficient manner. It also learned to anticipate ball movements and block opponent shots. The agent's tactical behavior adapts to specific game contexts in a way that would be impractical to manually design. Our agent was trained in simulation and transferred to real robots zero-shot. A combination of sufficiently high-frequency control, targeted dynamics randomization, and perturbations during training enabled good-quality transfer. In experiments, the agent walked 181% faster, turned 302% faster, took 63% less time to get up, and kicked a ball 34% faster than a scripted baseline.

Authors

  • Tuomas Haarnoja
    Google DeepMind, London, UK.
  • Ben Moran
    Google DeepMind, London, UK.
  • Guy Lever
    DeepMind, London, UK.
  • Sandy H Huang
    Google DeepMind, London, UK.
  • Dhruva Tirumala
    Google DeepMind, London, UK.
  • Jan Humplik
    Google DeepMind, London, UK.
  • Markus Wulfmeier
    Google DeepMind, London, UK.
  • Saran Tunyasuvunakool
    Google DeepMind, London, UK.
  • Noah Y Siegel
    Google DeepMind, London, UK.
  • Roland Hafner
    Google DeepMind, London, UK.
  • Michael Bloesch
    Google DeepMind, London, UK.
  • Kristian Hartikainen
    Google DeepMind, London, UK.
  • Arunkumar Byravan
    Google DeepMind, London, UK.
  • Leonard Hasenclever
    Google DeepMind, London, UK.
  • Yuval Tassa
    Google DeepMind, London, UK.
  • Fereshteh Sadeghi
    Google DeepMind, London, UK.
  • Nathan Batchelor
    Google DeepMind, London, UK.
  • Federico Casarini
    Google DeepMind, London, UK.
  • Stefano Saliceti
    Google DeepMind, London, UK.
  • Charles Game
    Google DeepMind, London, UK.
  • Neil Sreendra
    Google DeepMind, London, UK.
  • Kushal Patel
    Department of Radiology, Icahn School of Medicine at Mount Sinai, 1 Gustave L. Levy Place, New York, NY 10029, USA. Electronic address: kushal.patel@mountsinai.org.
  • Marlon Gwira
    Google DeepMind, London, UK.
  • Andrea Huber
    Google DeepMind, London, UK.
  • Nicole Hurley
    Google DeepMind, London, UK.
  • Francesco Nori
    Google DeepMind, London, UK.
  • Raia Hadsell
    DeepMind, London EC4 5TW, United Kingdom.
  • Nicolas Heess
    Google DeepMind, London, UK.