Encoding primitives generation policy learning for robotic arm to overcome catastrophic forgetting in sequential multi-tasks learning.

Journal: Neural networks : the official journal of the International Neural Network Society

Published Date: Jun 5, 2020

Abstract

Continual learning, a widespread ability in people and animals, aims to learn and acquire new knowledge and skills continuously. Catastrophic forgetting usually occurs in continual learning when an agent attempts to learn different tasks sequentially without storing or accessing previous task information. Unfortunately, current learning systems, e.g., neural networks, are prone to deviate the weights learned in previous tasks after training new tasks, leading to catastrophic forgetting, especially in a sequential multi-tasks scenario. To address this problem, in this paper, we propose to overcome catastrophic forgetting with the focus on learning a series of robotic tasks sequentially. Particularly, a novel hierarchical neural network's framework called Encoding Primitives Generation Policy Learning (E-PGPL) is developed to enable continual learning with two components. By employing a variational autoencoder to project the original state space into a meaningful low-dimensional feature space, representative state primitives could be sampled to help learn corresponding policies for different tasks. In learning a new task, the feature space is required to be close to the previous ones so that previously learned tasks can be protected. Extensive experiments on several simulated robotic tasks demonstrate our method's efficacy to learn control policies for handling sequentially arriving multi-tasks, delivering improvement substantially over some other continual learning methods, especially for the tasks with more diversity.

Authors

Fangzhou Xiong

State Key Lab of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Science, 100190, Beijing, China; School of Artificial Intelligence, University of Chinese Academy of Sciences (UCAS), 100049, Beijing, China.
Zhiyong Liu

State Key Laboratory of Respiratory Disease , Guangzhou Institutes of Biomedicine and Health (GIBH) , Chinese Academy of Sciences (CAS) , Guangzhou-510530 , China . Email: zhang_tianyu@gibh.ac.cn ; ; Tel: (+86)20 3201 5270.
Kaizhu Huang

Xi'an Jiaotong-Liverpool University, Suzhou, Jiangsu, China. Electronic address: Kaizhu.Huang@xjtlu.edu.cn.
Xu Yang

Department of Food Science and Technology, The Ohio State University, Columbus, OH, United States.
Hong Qiao

State Key Lab of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of SciencesBeijing, China; Chinese Academy of Sciences Center for Excellence in Brain Science and Intelligence TechnologyShanghai, China; University of Chinese Academy of SciencesBeijing, China.
Amir Hussain

Cognitive Signal-Image and Control Processing Research Laboratory, School of Natural Sciences, University of Stirling, Stirling, FK9 4LA, United Kingdom.

Keywords

Arm Humans Machine Learning Robotics

External Resources

View on PubMed Access via DOI PubMed (32535306)

Encoding primitives generation policy learning for robotic arm to overcome catastrophic forgetting in sequential multi-tasks learning.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

Encoding primitives generation policy learning for robotic arm to overcome catastrophic forgetting in sequential multi-tasks learning.

Abstract

Authors

Keywords

External Resources

Stay Ahead of Medical AI

Popular Topics

Recent Journals