AIMC Topic: Reward

Clear Filters Showing 11 to 20 of 118 articles

Offline reward shaping with scaling human preference feedback for deep reinforcement learning.

Neural networks : the official journal of the International Neural Network Society
Designing reward functions that fully align with human intent is often challenging. Preference-based Reinforcement Learning (PbRL) provides a framework where humans can select preferred segments through pairwise comparisons of behavior trajectory seg...

Highly valued subgoal generation for efficient goal-conditioned reinforcement learning.

Neural networks : the official journal of the International Neural Network Society
Goal-conditioned reinforcement learning is widely used in robot control, manipulating the robot to accomplish specific tasks by maximizing accumulated rewards. However, the useful reward signal is only received when the desired goal is reached, leadi...

Learning explainable task-relevant state representation for model-free deep reinforcement learning.

Neural networks : the official journal of the International Neural Network Society
State representations considerably accelerate learning speed and improve data efficiency for deep reinforcement learning (DRL), especially for visual tasks. Task-relevant state representations could focus on features relevant to the task, filter out ...

Exploring the effectiveness of reward-based learning strategies for second-language speech sounds.

Psychonomic bulletin & review
Adults struggle to learn non-native speech categories in many experimental settings (Goto, Neuropsychologia, 9(3), 317-323 1971), but learn efficiently in a video game paradigm where non-native speech sounds have functional significance (Lim & Holt, ...

Emergence of integrated behaviors through direct optimization for homeostasis.

Neural networks : the official journal of the International Neural Network Society
Homeostasis is a self-regulatory process, wherein an organism maintains a specific internal physiological state. Homeostatic reinforcement learning (RL) is a framework recently proposed in computational neuroscience to explain animal behavior. Homeos...

Salience Interest Option: Temporal abstraction with salience interest functions.

Neural networks : the official journal of the International Neural Network Society
Reinforcement Learning (RL) is a significant machine learning subfield that emphasizes learning actions based on environment to obtain optimal behavior policy. RL agents can make decisions at variable time scales in the form of temporal abstractions,...

Distributed deep reinforcement learning based on bi-objective framework for multi-robot formation.

Neural networks : the official journal of the International Neural Network Society
Improving generalization ability in multi-robot formation can reduce repetitive training and calculation. In this paper, we study the multi-robot formation problem with the ability to generalize the target position. Since the generalization ability o...

Anthropomorphism-based causal and responsibility attributions to robots.

Scientific reports
People tend to expect mental capabilities in a robot based on anthropomorphism and often attribute the cause and responsibility for a failure in human-robot interactions to the robot. This study investigated the relationship between mind perception, ...

Deep Reinforcement Learning on Autonomous Driving Policy With Auxiliary Critic Network.

IEEE transactions on neural networks and learning systems
Deep reinforcement learning (DRL) is a machine learning method based on rewards, which can be extended to solve some complex and realistic decision-making problems. Autonomous driving needs to deal with a variety of complex and changeable traffic sce...