Toward a Psychology of Deep Reinforcement Learning Agents Using a Cognitive Architecture.

Journal: Topics in cognitive science
Published Date:

Abstract

We argue that cognitive models can provide a common ground between human users and deep reinforcement learning (Deep RL) algorithms for purposes of explainable artificial intelligence (AI). Casting both the human and learner as cognitive models provides common mechanisms to compare and understand their underlying decision-making processes. This common grounding allows us to identify divergences and explain the learner's behavior in human understandable terms. We present novel salience techniques that highlight the most relevant features in each model's decision-making, as well as examples of this technique in common training environments such as Starcraft II and an OpenAI gridworld.

Authors

  • Konstantinos Mitsopoulos
    Psychology Department, Carnegie Mellon University.
  • Sterling Somers
    Psychology Department, Carnegie Mellon University.
  • Joel Schooler
    Institute for Human and Machine Cognition, Pensacola.
  • Christian Lebiere
    Psychology Department, Carnegie Mellon University.
  • Peter Pirolli
    Institute for Human and Machine Cognition, Pensacola.
  • Robert Thomson
    Psychology Department, Carnegie Mellon University.