首页 /研究 /Hybrid DQN-TD3 Reinforcement Learning for Autonomous Navigation in Dynamic Environments

LEARNING

Hybrid DQN-TD3 Reinforcement Learning for Autonomous Navigation in Dynamic Environments

Xiaoyi He, Danggui Chen, Zhenshuo Zhang, Zimeng Bai

发表年份: 2025
访问权限: 开放获取

摘要

This paper presents a hierarchical path-planning and control framework that combines a high-level Deep Q-Network (DQN) for discrete sub-goal selection with a low-level Twin Delayed Deep Deterministic Policy Gradient (TD3) controller for continuous actuation. The high-level module selects behaviors and sub-goals; the low-level module executes smooth velocity commands. We design a practical reward shaping scheme (direction, distance, obstacle avoidance, action smoothness, collision penalty, time penalty, and progress), together with a LiDAR-based safety gate that prevents unsafe motions. The system is implemented in ROS + Gazebo (TurtleBot3) and evaluated with PathBench metrics, including success rate, collision rate, path efficiency, and re-planning efficiency, in dynamic and partially observable environments. Experiments show improved success rate and sample efficiency over single-algorithm baselines (DQN or TD3 alone) and rule-based planners, with better generalization to unseen obstacle configurations and reduced abrupt control changes. Code and evaluation scripts are available at the project repository.

关键词

cs.ROcs.AIcs.LG

Hybrid DQN-TD3 Reinforcement Learning for Autonomous Navigation in Dynamic Environments

摘要

关键词

相关论文

The Organization of Behavior

Fractional Brownian Motions, Fractional Noises and Applications

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

A guide to deep learning in healthcare