AIMC Topic: Reward

Clear Filters Showing 41 to 50 of 118 articles

Biased Pressure: Cyclic Reinforcement Learning Model for Intelligent Traffic Signal Control.

Sensors (Basel, Switzerland)
Existing inefficient traffic signal plans are causing traffic congestions in many urban areas. In recent years, many deep reinforcement learning (RL) methods have been proposed to control traffic signals in real-time by interacting with the environme...

Semicentralized Deep Deterministic Policy Gradient in Cooperative StarCraft Games.

IEEE transactions on neural networks and learning systems
In this article, we propose a novel semicentralized deep deterministic policy gradient (SCDDPG) algorithm for cooperative multiagent games. Specifically, we design a two-level actor-critic structure to help the agents with interactions and cooperatio...

Exploration in neo-Hebbian reinforcement learning: Computational approaches to the exploration-exploitation balance with bio-inspired neural networks.

Neural networks : the official journal of the International Neural Network Society
Recent theoretical and experimental works have connected Hebbian plasticity with the reinforcement learning (RL) paradigm, producing a class of trial-and-error learning in artificial neural networks known as neo-Hebbian plasticity. Inspired by the ro...

Using deep learning to predict human decisions and using cognitive models to explain deep learning models.

Scientific reports
Deep neural networks (DNNs) models have the potential to provide new insights in the study of cognitive processes, such as human decision making, due to their high capacity and data-driven design. While these models may be able to go beyond theory-dr...

A deep reinforcement transfer convolutional neural network for rolling bearing fault diagnosis.

ISA transactions
Deep neural networks highly depend on substantial labeled samples when identifying bearing fault. However, in some practical situations, it is very difficult to collect sufficient labeled samples, which limits the application of deep neural networks ...

Outracing champion Gran Turismo drivers with deep reinforcement learning.

Nature
Many potential applications of artificial intelligence involve making real-time decisions in physical systems while interacting with humans. Automobile racing represents an extreme example of these conditions; drivers must execute complex tactical ma...

Toward the biological model of the hippocampus as the successor representation agent.

Bio Systems
The hippocampus is an essential brain region for spatial memory and learning. Recently, a theoretical model of the hippocampus based on temporal difference (TD) learning has been published. Inspired by the successor representation (SR) learning algor...

Meta-analysis of human prediction error for incentives, perception, cognition, and action.

Neuropsychopharmacology : official publication of the American College of Neuropsychopharmacology
Prediction errors (PEs) are a keystone for computational neuroscience. Their association with midbrain neural firing has been confirmed across species and has inspired the construction of artificial intelligence that can outperform humans. However, t...

Underwater gliders linear trajectory tracking: The experience breeding actor-critic approach.

ISA transactions
This paper studies the underwater glider trajectory tracking in currents field. The objective is to ensure that trajectories fit to the straight target track. The underwater glider model is introduced to demonstrate the vehicle dynamic properties. Co...

Exploring the Black Box of Managing Total Rewards for Older Professionals in the Canadian Financial Services Sector.

Canadian journal on aging = La revue canadienne du vieillissement
This study extends our knowledge about the management of older employees in the sector of financial services, which faces enormous transformational pressures (e.g., emergence of artificial intelligence, digital services). Based on the black box model...