AIMC Topic: Reinforcement, Psychology

Clear Filters Showing 81 to 90 of 293 articles

A Human-Machine Agent Based on Active Reinforcement Learning for Target Classification in Wargame.

IEEE transactions on neural networks and learning systems
To meet the requirements of high accuracy and low cost of target classification in modern warfare, and lay the foundation for target threat assessment, the article proposes a human-machine agent for target classification based on active reinforcement...

Towards biologically plausible model-based reinforcement learning in recurrent spiking networks by dreaming new experiences.

Scientific reports
Humans and animals can learn new skills after practicing for a few hours, while current reinforcement learning algorithms require a large amount of data to achieve good performances. Recent model-based approaches show promising results by reducing th...

GeneWorker: An end-to-end robotic reinforcement learning approach with collaborative generator and worker networks.

Neural networks : the official journal of the International Neural Network Society
Reinforcement learning aided by the skill conception exhibits potent capabilities in guiding autonomous agents toward acquiring meaningful behaviors. However, in the current landscape of reinforcement learning, a skill is often merely a rudimentary a...

A recurrent network model of planning explains hippocampal replay and human behavior.

Nature neuroscience
When faced with a novel situation, people often spend substantial periods of time contemplating possible futures. For such planning to be rational, the benefits to behavior must compensate for the time spent thinking. Here, we capture these features ...

A Dynamic Window Method Based on Reinforcement Learning for SSVEP Recognition.

IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society
Steady-state visual evoked potential (SSVEP) is one of the most used brain-computer interface (BCI) paradigms. Conventional methods analyze SSVEPs at a fixed window length. Compared with these methods, dynamic window methods can achieve a higher info...

Systematic literature review on reinforcement learning in non-communicable disease interventions.

Artificial intelligence in medicine
There is evidence that reducing modifiable risk factors and strengthening medical and health interventions can reduce early mortality and economic losses from non-communicable diseases (NCDs). Machine learning (ML) algorithms have been successfully a...

Emergence of integrated behaviors through direct optimization for homeostasis.

Neural networks : the official journal of the International Neural Network Society
Homeostasis is a self-regulatory process, wherein an organism maintains a specific internal physiological state. Homeostatic reinforcement learning (RL) is a framework recently proposed in computational neuroscience to explain animal behavior. Homeos...

Collaborative hunting in artificial agents with deep reinforcement learning.

eLife
Collaborative hunting, in which predators play different and complementary roles to capture prey, has been traditionally believed to be an advanced hunting strategy requiring large brains that involve high-level cognition. However, recent findings th...

A deep reinforcement learning algorithm framework for solving multi-objective traveling salesman problem based on feature transformation.

Neural networks : the official journal of the International Neural Network Society
As a special type of multi-objective combinatorial optimization problems (MOCOPs), the multi-objective traveling salesman problem (MOTSP) plays an important role in practical fields such as transportation and robot control. However, due to the comple...

Recurrent neural networks that learn multi-step visual routines with reinforcement learning.

PLoS computational biology
Many cognitive problems can be decomposed into series of subproblems that are solved sequentially by the brain. When subproblems are solved, relevant intermediate results need to be stored by neurons and propagated to the next subproblem, until the o...