Human adaptation to adaptive machines converges to game-theoretic equilibria.

Journal: Scientific reports
Published Date:

Abstract

Here we test three learning algorithms for machines playing general-sum games with human subjects. The algorithms enable the machine to select the outcome of the co-adaptive interaction from a constellation of game-theoretic equilibria in action and policy spaces. Importantly, the machine learning algorithms work directly from observations of human actions without solving an inverse problem to estimate the human's utility function as in prior work. Surprisingly, one algorithm can steer the human-machine interaction to the machine's optimum, effectively controlling the human's actions even while the human responds optimally to their perceived cost landscape. Our results show that game theory can be used to predict and design outcomes of co-adaptive interactions between intelligent humans and machines.

Authors

  • Benjamin J Chasnov
    Department of Electrical & Computer Engineering, University of Washington, Seattle, WA, 98195, USA.
  • Lillian J Ratliff
    Department of Electrical & Computer Engineering, University of Washington, Seattle, WA, 98195, USA.
  • Samuel A Burden
    Department of Electrical and Computer Engineering, University of Washington, Seattle, WA 98195, USA.