Reinforcement Learning: Full Glass or Empty - Depends Who You Ask.

Journal: Current biology : CB
PMID:

Abstract

An extension of the prediction error theory of dopamine, imported from artificial intelligence, represents the full distribution over future rewards rather than only the average and better explains dopamine responses.

Authors

  • Jacob J W Bakermans
    Wellcome Centre for Integrative Neuroimaging, FMRIB, Nuffield Department of Clinical Neurosciences, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DU, UK. Electronic address: Jacob.bakermans@ndcn.ox.ac.uk.
  • Timothy H Muller
    Institute of Neurology, Department of Clinical and Movement Neurosciences, University College London, London WC1N 3BG, UK. Electronic address: timothymuller127@gmail.com.
  • Timothy E J Behrens
    Wellcome Centre for Integrative Neuroimaging, FMRIB, Nuffield Department of Clinical Neurosciences, University of Oxford, John Radcliffe Hospital, Oxford OX3 9DU, UK; Wellcome Centre for Human Neuroimaging, University College London, London WC1N 3AR, UK. Electronic address: behrens@fmrib.ox.ac.uk.