Outracing champion Gran Turismo drivers with deep reinforcement learning.

Journal: Nature
PMID:

Abstract

Many potential applications of artificial intelligence involve making real-time decisions in physical systems while interacting with humans. Automobile racing represents an extreme example of these conditions; drivers must execute complex tactical manoeuvres to pass or block opponents while operating their vehicles at their traction limits. Racing simulations, such as the PlayStation game Gran Turismo, faithfully reproduce the non-linear control challenges of real race cars while also encapsulating the complex multi-agent interactions. Here we describe how we trained agents for Gran Turismo that can compete with the world's best e-sports drivers. We combine state-of-the-art, model-free, deep reinforcement learning algorithms with mixed-scenario training to learn an integrated control policy that combines exceptional speed with impressive tactics. In addition, we construct a reward function that enables the agent to be competitive while adhering to racing's important, but under-specified, sportsmanship rules. We demonstrate the capabilities of our agent, Gran Turismo Sophy, by winning a head-to-head competition against four of the world's best Gran Turismo drivers. By describing how we trained championship-level racers, we demonstrate the possibilities and challenges of using these techniques to control complex dynamical systems in domains where agents must respect imprecisely defined human norms.

Authors

  • Peter R Wurman
    Sony AI, New York, NY, USA. peter.wurman@sony.com.
  • Samuel Barrett
    Sony AI, New York, NY, USA.
  • Kenta Kawamoto
    Sony AI, Tokyo, Japan.
  • James MacGlashan
    Sony AI, New York, NY, USA.
  • Kaushik Subramanian
    Sony AI, Zürich, Switzerland.
  • Thomas J Walsh
    Sony AI, New York, NY, USA.
  • Roberto Capobianco
    Sony AI, Zürich, Switzerland.
  • Alisa Devlic
    Sony AI, Zürich, Switzerland.
  • Franziska Eckert
    Sony AI, Zürich, Switzerland.
  • Florian Fuchs
    Sony AI, Zürich, Switzerland.
  • Leilani Gilpin
    Sony AI, New York, NY, USA.
  • Piyush Khandelwal
    Sony AI, New York, NY, USA.
  • Varun Kompella
    Sony AI, New York, NY, USA.
  • HaoChih Lin
    Sony AI, Zürich, Switzerland.
  • Patrick MacAlpine
    Sony AI, New York, NY, USA.
  • Declan Oller
    Sony AI, New York, NY, USA.
  • Takuma Seno
    Sony AI, Tokyo, Japan.
  • Craig Sherstan
    Division of Physical Medicine & Rehabilitation, University of Alberta, Edmonton, AB, Canada Department of Computing Science, University of Alberta, Edmonton, AB, Canada.
  • Michael D Thomure
    Sony AI, New York, NY, USA.
  • Houmehr Aghabozorgi
    Sony AI, New York, NY, USA.
  • Leon Barrett
    Sony AI, New York, NY, USA.
  • Rory Douglas
    Sony AI, New York, NY, USA.
  • Dion Whitehead
    Sony AI, New York, NY, USA.
  • Peter Dürr
    Sony AI, Zürich, Switzerland.
  • Peter Stone
  • Michael Spranger
    Sony AI, Tokyo, Japan.
  • Hiroaki Kitano
    Systems Biology Institute, Tokyo, Japan.