首页 /研究 /ALARM: Safe Reinforcement Learning With Reliable Mimicry for Robust Legged Locomotion
LOCOMOTION

ALARM: Safe Reinforcement Learning With Reliable Mimicry for Robust Legged Locomotion

Qiqi Zhou, Hui Ding, Teng Chen, Han Jiang, Guoteng Zhang, Bin Li, Xuewen Rong, Yibin Li

发表年份
2025
引用次数
5

摘要

Legged robots are supposed to traverse complicated environments, which makes it challenging to design a model-based controller due to their functional complexity. Currently, using deep reinforcement learning to improve the adaptability of robots in complex scenarios has been a major research trend. In this paper, we propose Adaptive Latent Aggregation for Reliable Mimicry (ALARM), a reinforcement learning framework that enables safe and robust locomotion in legged robots using only proprioception. This work features a one-step teacher-student training paradigm by constructing an adaptive aggregation strategy, which integrates the merits of imitation learning and reinforcement learning effectively. The framework integrates normalized penalized proximal policy optimization, which penalizes constraint-violating behaviors while optimizing locomotion policy. Our method facilitates efficient sim-to-real transfer, offering a promising approach for real-world legged robot applications.

关键词

MimicryReinforcement learningALARMComputer scienceReinforcementArtificial intelligencePsychologyBiologyEngineeringEcology

相关论文

查看 LOCOMOTION 分类全部论文