Human-level performance in 3D multiplayer games with population-based reinforcement learning.

Journal: Science (New York, N.Y.)

PMID: 31147514

Abstract

Reinforcement learning (RL) has shown great success in increasingly complex single-agent environments and two-player turn-based games. However, the real world contains multiple agents, each learning and acting independently to cooperate and compete with other agents. We used a tournament-style evaluation to demonstrate that an agent can achieve human-level performance in a three-dimensional multiplayer first-person video game, in Capture the Flag mode, using only pixels and game points scored as input. We used a two-tier optimization process in which a population of independent RL agents are trained concurrently from thousands of parallel matches on randomly generated environments. Each agent learns its own internal reward signal and rich representation of the world. These results indicate the great potential of multiagent reinforcement learning for artificial intelligence research.

Authors

Max Jaderberg

DeepMind, London, UK. lejlot@google.com jaderberg@google.com.
Wojciech M Czarnecki

Faculty of Mathematics and Computer Science, Jagiellonian University, Lojasiewicza 6, 30-348 Krakow, Poland. wojciech.czarnecki@uj.edu.pl.
Iain Dunning

DeepMind, London, UK.
Luke Marris

DeepMind, London, UK.
Guy Lever

DeepMind, London, UK.
Antonio Garcia Castañeda

DeepMind, London, UK.
Charles Beattie

Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
Neil C Rabinowitz

DeepMind, London, UK.
Ari S Morcos

DeepMind, 5 New Street Square, London EC4A 3TW, UK.
Avraham Ruderman

DeepMind, 5 New Street Square, London EC4A 3TW, UK.
Nicolas Sonnerat

DeepMind, London, UK.
Tim Green

DeepMind, London, UK.
Louise Deason

DeepMind, London, UK.
Joel Z Leibo

DeepMind, London, UK.
David Silver

Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
Demis Hassabis

Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
Koray Kavukcuoglu

Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
Thore Graepel

Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.

Keywords

Machine Learning Reinforcement, Psychology Reward Video Games

External Resources

View on PubMed Access via DOI PubMed (31147514)

Human-level performance in 3D multiplayer games with population-based reinforcement learning.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals