AIMC Topic: Reward

Clear Filters Showing 101 to 110 of 129 articles

Model-based reinforcement learning with dimension reduction.

Neural networks : the official journal of the International Neural Network Society
The goal of reinforcement learning is to learn an optimal policy which controls an agent to acquire the maximum cumulative reward. The model-based reinforcement learning approach learns a transition model of the environment from data, and then derive...

Functional Relevance of Different Basal Ganglia Pathways Investigated in a Spiking Model with Reward Dependent Plasticity.

Frontiers in neural circuits
The brain enables animals to behaviorally adapt in order to survive in a complex and dynamic environment, but how reward-oriented behaviors are achieved and computed by its underlying neural circuitry is an open question. To address this concern, we ...

Non-reward neural mechanisms in the orbitofrontal cortex.

Cortex; a journal devoted to the study of the nervous system and behavior
Single neurons in the primate orbitofrontal cortex respond when an expected reward is not obtained, and behaviour must change. The human lateral orbitofrontal cortex is activated when non-reward, or loss occurs. The neuronal computation of this negat...

A neural model of the frontal eye fields with reward-based learning.

Neural networks : the official journal of the International Neural Network Society
Decision-making is a flexible process dependent on the accumulation of various kinds of information; however, the corresponding neural mechanisms are far from clear. We extended a layered model of the frontal eye field to a learning-based model, usin...

A Fly-Inspired Mushroom Bodies Model for Sensory-Motor Control Through Sequence and Subsequence Learning.

International journal of neural systems
Classification and sequence learning are relevant capabilities used by living beings to extract complex information from the environment for behavioral control. The insect world is full of examples where the presentation time of specific stimuli shap...

Demixed principal component analysis of neural population data.

eLife
Neurons in higher cortical areas, such as the prefrontal cortex, are often tuned to a variety of sensory and motor variables, and are therefore said to display mixed selectivity. This complexity of single neuron responses can obscure what information...

Learning to Produce Syllabic Speech Sounds via Reward-Modulated Neural Plasticity.

PloS one
At around 7 months of age, human infants begin to reliably produce well-formed syllables containing both consonants and vowels, a behavior called canonical babbling. Over subsequent months, the frequency of canonical babbling continues to increase. H...

Implicit Value Updating Explains Transitive Inference Performance: The Betasort Model.

PLoS computational biology
Transitive inference (the ability to infer that B > D given that B > C and C > D) is a widespread characteristic of serial learning, observed in dozens of species. Despite these robust behavioral effects, reinforcement learning models reliant on rewa...

A biological mechanism for Bayesian feature selection: Weight decay and raising the LASSO.

Neural networks : the official journal of the International Neural Network Society
Biological systems are capable of learning that certain stimuli are valuable while ignoring the many that are not, and thus perform feature selection. In machine learning, one effective feature selection approach is the least absolute shrinkage and s...