首页 /研究 /Efficient Exploration in Large State-Action Space Through Structured Action Space for Learning Multirobots Motion Planning
MANIPULATION

Efficient Exploration in Large State-Action Space Through Structured Action Space for Learning Multirobots Motion Planning

Chaoxu Mu, Zewu Jiang, Jiadong Zhang, Ke Wang, Xin Xu, Jun Yi

发表年份
2025
引用次数
2

摘要

Multirobots tasks, such as collaborative motion planning or flexible parts assembly, are complex for autonomous robot operations. The robot system for such tasks would need a combination of multigoal motion control and collision avoidance in the shared workspace. For reinforcement learning (RL)-based methods with enhanced flexibility and broader applicability, these methods struggle to effectively handle intricate tasks that require collision-free motion planning and control across long time horizons. Current RL systems cannot achieve efficiently motion planning due to high degrees-of-freedom (DoF), large-scale continuous action space, and tightly coupled workspaces of multirobot system. Additionally, the exploration efficiency for multigoal tasks is typically low, particularly in scenarios where rewards are scarce. To tackle these issues, we propose a new method to decompose the large-scale continuous action space into the combination of discrete action space that enhances exploration and small-scale continuous action space that enhances accuracy of structured hybrid action space (SHAS). To train the model with SHAS efficiently, we develop a task-conditioned hierarchical RL (TC-HRL) framework to train both high-level (HL) policy and low-level (LL) policy in parallel. Comparative experiments from different perspectives demonstrate that our model can learn multirobots manipulation tasks more efficiently and stably, and with considerably less time consumption. Finally, our work is verified by the pick-grasp tasks and real manipulators experiments.

关键词

WorkspaceReinforcement learningFlexibility (engineering)Action (physics)Motion (physics)Motion planningSpace (punctuation)RobotCollision avoidance

相关论文

查看 MANIPULATION 分类全部论文