A path planning method based on noisy D3QN algorithm with N-step updates

Lieping Zhang, Ming Zhang, Zhufei Leng, Yilin Wang

发表年份: 2025
引用次数: 3

摘要

To address the issues of unstable Q-value estimation and insufficient exploration during the early training stage of the Dueling Double Deep Q Network (D3QN), an N-step and Noisy D3QN algorithm is proposed. First, an N-step update strategy is designed, in which multi-step cumulative rewards replace single-step rewards. Second, learnable exploration noise is incorporated into the neural network so that each action selection of the mobile robot depends not only on the Q-value but also on stochastic perturbations, thereby enhancing exploration ability. Finally, ablation studies are conducted to quantify the incremental contributions of each component. Across multiple simulation environments with both static and dynamic obstacles, the proposed algorithm outperforms DDQN, D3QN, SAE-DDQN, and D3QN-PER in terms of average path length, average number of steps, and average traversal time. Furthermore, experiments in real-world environments verify the feasibility and robustness of the proposed method.

关键词

Tree traversalRobustness (evolution)Motion planningNoise (video)Artificial neural networkMobile robotPath (computing)

A path planning method based on noisy D3QN algorithm with N-step updates

摘要

关键词

相关论文

A new optimizer using particle swarm theory

Self-Organizing Maps

The Organization of Behavior

A robust layered control system for a mobile robot