A path planning method based on noisy D3QN algorithm with N-step updates
Lieping Zhang, Ming Zhang, Zhufei Leng, Yilin Wang
- 发表年份
- 2025
- 引用次数
- 3
摘要
To address the issues of unstable Q-value estimation and insufficient exploration during the early training stage of the Dueling Double Deep Q Network (D3QN), an N-step and Noisy D3QN algorithm is proposed. First, an N-step update strategy is designed, in which multi-step cumulative rewards replace single-step rewards. Second, learnable exploration noise is incorporated into the neural network so that each action selection of the mobile robot depends not only on the Q-value but also on stochastic perturbations, thereby enhancing exploration ability. Finally, ablation studies are conducted to quantify the incremental contributions of each component. Across multiple simulation environments with both static and dynamic obstacles, the proposed algorithm outperforms DDQN, D3QN, SAE-DDQN, and D3QN-PER in terms of average path length, average number of steps, and average traversal time. Furthermore, experiments in real-world environments verify the feasibility and robustness of the proposed method.
关键词
相关论文
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002
Self-Organizing Maps
Teuvo Kohonen
1995
The Organization of Behavior
D. O. Hebb
2005
A robust layered control system for a mobile robot
Rodney A. Brooks
1986