Human-level play in the game of by combining language models with strategic reasoning.

Journal: Science (New York, N.Y.)
PMID:

Abstract

Despite much progress in training artificial intelligence (AI) systems to imitate human language, building agents that use language to communicate intentionally with humans in interactive environments remains a major challenge. We introduce Cicero, the first AI agent to achieve human-level performance in , a strategy game involving both cooperation and competition that emphasizes natural language negotiation and tactical coordination between seven players. Cicero integrates a language model with planning and reinforcement learning algorithms by inferring players' beliefs and intentions from its conversations and generating dialogue in pursuit of its plans. Across 40 games of an anonymous online league, Cicero achieved more than double the average score of the human players and ranked in the top 10% of participants who played more than one game.

Authors

  • Anton Bakhtin
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Noam Brown
    Computer Science Department, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA 15213, USA.
  • Emily Dinan
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Gabriele Farina
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Colin Flaherty
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Daniel Fried
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Andrew Goff
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Jonathan Gray
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Hengyuan Hu
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Athul Paul Jacob
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Mojtaba Komeili
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Karthik Konath
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Minae Kwon
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Adam Lerer
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Mike Lewis
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Alexander H Miller
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Sasha Mitts
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Adithya Renduchintala
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Stephen Roller
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Dirk Rowe
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Weiyan Shi
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Joe Spisak
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Alexander Wei
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • David Wu
    School of Nursing & Health Professions, Georgia State University, Atlanta, GA.
  • Hugh Zhang
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.
  • Markus Zijlstra
    Meta AI, 1 Hacker Way, Menlo Park, CA, USA.