Model-based reinforcement learning with dimension reduction.

Journal: Neural networks : the official journal of the International Neural Network Society

Published Date: Aug 24, 2016

Abstract

The goal of reinforcement learning is to learn an optimal policy which controls an agent to acquire the maximum cumulative reward. The model-based reinforcement learning approach learns a transition model of the environment from data, and then derives the optimal policy using the transition model. However, learning an accurate transition model in high-dimensional environments requires a large amount of data which is difficult to obtain. To overcome this difficulty, in this paper, we propose to combine model-based reinforcement learning with the recently developed least-squares conditional entropy (LSCE) method, which simultaneously performs transition model estimation and dimension reduction. We also further extend the proposed method to imitation learning scenarios. The experimental results show that policy search combined with LSCE performs well for high-dimensional control tasks including real humanoid robot control.

Authors

Voot Tangkaratt

Department of Computer Science, Tokyo Institute of Technology, Meguro-ku, Tokyo 152-8552, Japan voot@sg.cs.titech.ac.jp.
Jun Morimoto

Dept. of Brain Robot Interface, ATR Computational Neuroscience Labs, Kyoto, Japan.
Masashi Sugiyama

Keywords

Artificial Intelligence Entropy Learning Least-Squares Analysis Machine Learning Models, Theoretical Reinforcement, Psychology Reward

External Resources

View on PubMed Access via DOI PubMed (27639719)

Model-based reinforcement learning with dimension reduction.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals