Home /Research /Multi-head Fusion-based Actor-Critic Deep Reinforcement Learning with Memory Contextualisation for End-to-End Autonomous Navigation

LEARNING

Multi-head Fusion-based Actor-Critic Deep Reinforcement Learning with Memory Contextualisation for End-to-End Autonomous Navigation

Seunghyeop Nam, Tuấn Anh Nguyễn, Eunmi Choi, Dugki Min

Year: 2025
Citations: 2
Access: Open access

Abstract

Robust autonomous navigation in dynamic and cluttered environments is a persistent challenge for mobile robotics. Conventional methods, reliant on static maps and classical algorithms, often falter when faced with frequently reconfigured obstacle layouts. Deep Reinforcement Learning (DRL) enables adaptive, map-free navigation, yet the complexity of high-resolution LiDAR data and the presence of subtle, moving obstacles complicate training and scalability. To address these issues, we present the Multi-Head Memory Contextualising TD3 (mhmcTD3) architecture, a novel DRL framework that unites multiple specialised network heads-LiDAR fusion, LiDAR Feature, Robot States, and LiDAR Memory-within a unified actor-critic backbone. This integrated design emphasises critical LiDAR states through tailored preprocessing (inversion and exponential scaling), leverages CNNs for refined feature extraction, and employs an LSTM-based LiDAR Memory head for temporal awareness. Additionally, the incorporation of the SiLU activation function and the CoRE optimiser enhances learning stability and convergence efficiency. Extensive evaluations in both simulation (using GAZEBO with ROS2) and real-world tests on a Turtlebot3 waffle_pi platform demonstrate that mhmcTD3 delivers state-of-the-art performance under diverse LiDAR resolutions. It excels at detecting and avoiding small, dynamic obstacles while maintaining adaptability in dense, fast-changing scenarios. Ablation studies further confirm the indispensable synergy among the multiple heads. Taken together, these contributions establish mhmcTD3 as a robust and generalisable DRL-based navigation solution, bridging the gap from controlled simulations to complex real-world environments.

Keywords

End-to-end principleReinforcement learningComputer scienceHead (geology)Artificial intelligenceFusionReinforcementPsychologySocial psychologyPhilosophy

Multi-head Fusion-based Actor-Critic Deep Reinforcement Learning with Memory Contextualisation for End-to-End Autonomous Navigation

Abstract

Keywords

Related papers

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory