AIMC Topic: Reinforcement, Psychology

Clear Filters Showing 21 to 30 of 264 articles

Incremental model-based reinforcement learning with model constraint.

Neural networks : the official journal of the International Neural Network Society
In model-based reinforcement learning (RL) approaches, the estimated model of a real environment is learned with limited data and then utilized for policy optimization. As a result, the policy optimization process in model-based RL is influenced by b...

Aligning large language models with radiologists by reinforcement learning from AI feedback for chest CT reports.

European journal of radiology
BACKGROUND: Large language models (LLMs) often struggle to fully capture the nuanced preferences and clinical judgement of radiologists in medical report summarization even when fine-tuned on massive medical reports. This could lead to the generated ...

Sample-efficient and occlusion-robust reinforcement learning for robotic manipulation via multimodal fusion dualization and representation normalization.

Neural networks : the official journal of the International Neural Network Society
Recent advances in visual reinforcement learning (visual RL), which learns from high-dimensional image observations, have narrowed the gap between state-based and image-based training. However, visual RL continues to face significant challenges in ro...

A multi-agent reinforcement learning framework for cross-domain sequential recommendation.

Neural networks : the official journal of the International Neural Network Society
Sequential recommendation models aim to predict the next item based on the sequence of items users interact with, ordered chronologically. However, these models face the challenge of data sparsity. Recent studies have explored cross-domain sequential...

A rule- and query-guided reinforcement learning for extrapolation reasoning in temporal knowledge graphs.

Neural networks : the official journal of the International Neural Network Society
Extrapolation reasoning in temporal knowledge graphs (TKGs) aims at predicting future facts based on historical data, and finds extensive application in diverse real-world scenarios. Existing TKG reasoning methods primarily focus on capturing the fac...

Simulating fish autonomous swimming behaviours using deep reinforcement learning based on Kolmogorov-Arnold Networks.

Bioinspiration & biomimetics
The study of fish swimming behaviours and locomotion mechanisms holds significant scientific and engineering value. With the rapid advancements in artificial intelligence, a new method combining deep reinforcement learning (DRL) with computational fl...

Novel deep reinforcement learning based collision avoidance approach for path planning of robots in unknown environment.

PloS one
Reinforcement learning is a remarkable aspect of the artificial intelligence field with many applications. Reinforcement learning facilitates learning new tasks based on action and reward principles. Motion planning addresses the navigation problem f...

Explainable post hoc portfolio management financial policy of a Deep Reinforcement Learning agent.

PloS one
Financial portfolio management investment policies computed quantitatively by modern portfolio theory techniques like the Markowitz model rely on a set of assumptions that are not supported by data in high volatility markets such as the technological...

Dynamic planning in hierarchical active inference.

Neural networks : the official journal of the International Neural Network Society
By dynamic planning, we refer to the ability of the human brain to infer and impose motor trajectories related to cognitive decisions. A recent paradigm, active inference, brings fundamental insights into the adaptation of biological organisms, const...

VCSAP: Online reinforcement learning exploration method based on visitation count of state-action pairs.

Neural networks : the official journal of the International Neural Network Society
In the domain of online reinforcement learning, strategies that leverage inherent rewards for exploration tend to achieve commendable outcomes within contexts characterized by deceptive or sparse rewards. Counting through the visitation of states is ...