AIMC Topic: Reward

Clear Filters Showing 31 to 40 of 121 articles

Goals, usefulness and abstraction in value-based choice.

Trends in cognitive sciences
Colombian drug lord Pablo Escobar, while on the run, purportedly burned two million dollars in banknotes to keep his daughter warm. A stark reminder that, in life, circumstances and goals can quickly change, forcing us to reassess and modify our valu...

Robust Inverse Q-Learning for Continuous-Time Linear Systems in Adversarial Environments.

IEEE transactions on cybernetics
This article proposes robust inverse Q -learning algorithms for a learner to mimic an expert's states and control inputs in the imitation learning problem. These two agents have different adversarial disturbances. To do the imitation, the learner mus...

Deep Reinforcement Learning for the Detection of Abnormal Data in Smart Meters.

Sensors (Basel, Switzerland)
The rapidly growing power data in smart grids have created difficulties in security management. The processing of large-scale power data with the use of artificial intelligence methods has become a hotspot research topic. Considering the early warnin...

Orientation-Preserving Rewards' Balancing in Reinforcement Learning.

IEEE transactions on neural networks and learning systems
Auxiliary rewards are widely used in complex reinforcement learning tasks. However, previous work can hardly avoid the interference of auxiliary rewards on pursuing the main rewards, which leads to the destruction of the optimal policy. Thus, it is c...

The Influence of Robots' Fairness on Humans' Reward-Punishment Behaviors and Trust in Human-Robot Cooperative Teams.

Human factors
OBJECTIVE: Based on social exchange theory, this study investigates the effects of robots' fairness and social status on humans' reward-punishment behaviors and trust in human-robot interactions.

Socially situated artificial intelligence enables learning from human interaction.

Proceedings of the National Academy of Sciences of the United States of America
Regardless of how much data artificial intelligence agents have available, agents will inevitably encounter previously unseen situations in real-world deployments. Reacting to novel situations by acquiring new information from other people-socially s...

Flexible control as surrogate reward or dynamic reward maximization.

Cognition
The utility of a given experience, like interacting with a particular friend or tasting a particular food, fluctuates continually according to homeostatic and hedonic principles. Consequently, to maximize reward, an individual must be able to escape ...

Value-free random exploration is linked to impulsivity.

Nature communications
Deciding whether to forgo a good choice in favour of exploring a potentially more rewarding alternative is one of the most challenging arbitrations both in human reasoning and in artificial intelligence. Humans show substantial variability in their e...

Application of Deep Reinforcement Learning to NS-SHAFT Game Signal Control.

Sensors (Basel, Switzerland)
Reinforcement learning (RL) with both exploration and exploit abilities is applied to games to demonstrate that it can surpass human performance. This paper mainly applies Deep Q-Network (DQN), which combines reinforcement learning and deep learning ...

Generalized Single-Vehicle-Based Graph Reinforcement Learning for Decision-Making in Autonomous Driving.

Sensors (Basel, Switzerland)
In the autonomous driving process, the decision-making system is mainly used to provide macro-control instructions based on the information captured by the sensing system. Learning-based algorithms have apparent advantages in information processing a...