AIMC Topic: Reward

Clear Filters Showing 31 to 40 of 118 articles

Orientation-Preserving Rewards' Balancing in Reinforcement Learning.

IEEE transactions on neural networks and learning systems
Auxiliary rewards are widely used in complex reinforcement learning tasks. However, previous work can hardly avoid the interference of auxiliary rewards on pursuing the main rewards, which leads to the destruction of the optimal policy. Thus, it is c...

The Influence of Robots' Fairness on Humans' Reward-Punishment Behaviors and Trust in Human-Robot Cooperative Teams.

Human factors
OBJECTIVE: Based on social exchange theory, this study investigates the effects of robots' fairness and social status on humans' reward-punishment behaviors and trust in human-robot interactions.

Socially situated artificial intelligence enables learning from human interaction.

Proceedings of the National Academy of Sciences of the United States of America
Regardless of how much data artificial intelligence agents have available, agents will inevitably encounter previously unseen situations in real-world deployments. Reacting to novel situations by acquiring new information from other people-socially s...

Flexible control as surrogate reward or dynamic reward maximization.

Cognition
The utility of a given experience, like interacting with a particular friend or tasting a particular food, fluctuates continually according to homeostatic and hedonic principles. Consequently, to maximize reward, an individual must be able to escape ...

Value-free random exploration is linked to impulsivity.

Nature communications
Deciding whether to forgo a good choice in favour of exploring a potentially more rewarding alternative is one of the most challenging arbitrations both in human reasoning and in artificial intelligence. Humans show substantial variability in their e...

Application of Deep Reinforcement Learning to NS-SHAFT Game Signal Control.

Sensors (Basel, Switzerland)
Reinforcement learning (RL) with both exploration and exploit abilities is applied to games to demonstrate that it can surpass human performance. This paper mainly applies Deep Q-Network (DQN), which combines reinforcement learning and deep learning ...

Generalized Single-Vehicle-Based Graph Reinforcement Learning for Decision-Making in Autonomous Driving.

Sensors (Basel, Switzerland)
In the autonomous driving process, the decision-making system is mainly used to provide macro-control instructions based on the information captured by the sensing system. Learning-based algorithms have apparent advantages in information processing a...

Sharing Rewards Undermines Coordinated Hunting.

Journal of computational biology : a journal of computational molecular cell biology
Coordinated hunting is widely observed in animals, and sharing rewards is often considered a major incentive for its success. While current theories about the role played by sharing in coordinated hunting are based on correlational evidence, we revea...

Exploring Potential Energy Surfaces Using Reinforcement Machine Learning.

Journal of chemical information and modeling
Reinforcement machine learning is implemented to survey a series of model potential energy surfaces and ultimately identify the global minima point. Through sophisticated reward function design, the introduction of an optimizing target, and incorpora...

Deep Reinforcement Learning With Modulated Hebbian Plus Q-Network Architecture.

IEEE transactions on neural networks and learning systems
In this article, we consider a subclass of partially observable Markov decision process (POMDP) problems which we termed confounding POMDPs. In these types of POMDPs, temporal difference (TD)-based reinforcement learning (RL) algorithms struggle, as ...