AIMC Topic: Reward

Clear Filters Showing 101 to 110 of 121 articles

Implicit Value Updating Explains Transitive Inference Performance: The Betasort Model.

PLoS computational biology
Transitive inference (the ability to infer that B > D given that B > C and C > D) is a widespread characteristic of serial learning, observed in dozens of species. Despite these robust behavioral effects, reinforcement learning models reliant on rewa...

A biological mechanism for Bayesian feature selection: Weight decay and raising the LASSO.

Neural networks : the official journal of the International Neural Network Society
Biological systems are capable of learning that certain stimuli are valuable while ignoring the many that are not, and thus perform feature selection. In machine learning, one effective feature selection approach is the least absolute shrinkage and s...

Rethinking exploration-exploitation trade-off in reinforcement learning via cognitive consistency.

Neural networks : the official journal of the International Neural Network Society
The exploration-exploitation dilemma is one of the fundamental challenges in deep reinforcement learning (RL). Agents must strike a trade-off between making decisions based on current beliefs or gathering more information. Prior work mostly prefers d...

HCPI-HRL: Human Causal Perception and Inference-driven Hierarchical Reinforcement Learning.

Neural networks : the official journal of the International Neural Network Society
The dependency on extensive expert knowledge for defining subgoals in hierarchical reinforcement learning (HRL) restricts the training efficiency and adaptability of HRL agents in complex, dynamic environments. Inspired by human-guided causal discove...

Episodic Memory-Double Actor-Critic Twin Delayed Deep Deterministic Policy Gradient.

Neural networks : the official journal of the International Neural Network Society
Existing deep reinforcement learning (DRL) algorithms suffer from the problem of low sample efficiency. Episodic memory allows DRL algorithms to remember and use past experiences with high return, thereby improving sample efficiency. However, due to ...

Reinforcement learning for automated method development in liquid chromatography: insights in the reward scheme and experimental budget selection.

Journal of chromatography. A
Chromatographic problem solving, commonly referred to as method development (MD), is hugely complex, given the many operational parameters that must be optimized and their large effect on the elution times of individual sample compounds. Recently, th...

Unmasking the Dark Triad: A Data Fusion Machine Learning Approach to Characterize the Neural Bases of Narcissistic, Machiavellian and Psychopathic Traits.

The European journal of neuroscience
The Dark Triad (DT), encompassing narcissism, Machiavellianism and psychopathy traits, poses significant societal challenges. Understanding the neural underpinnings of these traits is crucial for developing effective interventions and preventive stra...

Dynamic Inverse Reinforcement Learning for Feedback-driven Reward Estimation in Brain Machine Interface Tasks.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference
Reinforcement learning (RL)-based brain machine interfaces (BMIs) provide a promising solution for paralyzed people. Enhancing the decoding performance of RL-based BMIs relies on the design of effective reward signals. Inverse reinforcement learning ...

An approach to solving optimal control problems of nonlinear systems by introducing detail-reward mechanism in deep reinforcement learning.

Mathematical biosciences and engineering : MBE
In recent years, dynamic programming and reinforcement learning theory have been widely used to solve the nonlinear control system (NCS). Among them, many achievements have been made in the construction of network model and system stability analysis,...