Efficient TD3 based path planning of mobile robot in dynamic environments using prioritized experience replay and LSTM.

Journal: Scientific reports
Published Date:

Abstract

To address the challenges of sample utilization efficiency and managing temporal dependencies, this paper proposes an efficient path planning method for mobile robot in dynamic environments based on an improved twin delayed deep deterministic policy gradient (TD3) algorithm. The proposed method, named PL-TD3, integrates prioritized experience replay (PER) and long short-term memory (LSTM) neural networks, which enhance both sample efficiency and the ability to handle time-series data. To verify the effectiveness of the proposed method, simulation and practical experiments were designed and conducted. In the simulation experiments, both static and dynamic obstacles were included in the test environment, along with experiments to assess generalization capabilities. The algorithm demonstrated superior performance in terms of both execution time and path efficiency. The practical experiments, based on the assumptions from the simulation tests, further confirmed that PL-TD3 has improved the effectiveness and robustness of path planning for mobile robot in dynamic environments.

Authors

  • Yunhan Lin
    School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan, 430081, China.
  • Zhijie Zhang
    School of Instruments and Electronics, North University of China, Taiyuan, China.
  • Yijian Tan
    School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan, 430081, China.
  • Hao Fu
    Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, P. R. China.
  • Huasong Min
    Department of Robotics and Intelligent Systems, Wuhan University of Science and Technology, Wuhan, China.

Keywords

No keywords available for this article.