首页 /研究 /Hierarchically Connecting Modularly-Learned Policies to Generate a Controller for a Combined Robot System

LEARNING

Hierarchically Connecting Modularly-Learned Policies to Generate a Controller for a Combined Robot System

Sho Takeda, Satoshi Yamamori, Satoshi Yagi, Jun Morimoto

发表年份: 2025
引用次数: 1

摘要

Deep reinforcement learning offers a promising approach for controlling robots with high degrees of freedom. However, its application is limited by substantial data requirements and difficulties in simulating complex physical interactions. This paper proposes a novel component-wise hierarchical policy learning approach that addresses these challenges by decomposing the robot into individual components and learning a separate policy for each. This strategy improves data efficiency and allows for more focused training of individual sub-systems. Upper-level policies then integrate these component policies through a separate learning process, enabling coordinated control of the robot whole-body. This hierarchical structure can be interpreted as a curriculum learning strategy, where the robot gradually learns more complex tasks by mastering individual component skills first. By learning modular policies and then combining them, the approach offers improved generalization and robustness compared to monolithic policy learning. We validate the approach on a modular robot, demonstrating that this hierarchical, component-wise policy learning framework enables efficient control of complex robots.

关键词

Modular designRobustness (evolution)RobotReinforcement learningComponent (thermodynamics)Robot learningGeneralization

Hierarchically Connecting Modularly-Learned Policies to Generate a Controller for a Combined Robot System

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Self-Organizing Maps

Vision meets robotics: The KITTI dataset