Priority-Based Reward Mechanism for Dual-Robot Path Planning in Dynamic Environments
Yibo Hu, Wei Li, Xiaoyu Guo, Zhenyao Li, Yanding Wei, Qiang Fang
- 发表年份
- 2025
- 引用次数
- 1
摘要
The definition of motion priority enables robot groups to handle competition and cooperation better when performing physical tasks. In this paper, we propose a priority-based step reward mechanism, which is a new reward mechanism for deep reinforcement learning of multi-robot systems and can improve collaboration between robotic arms in shared workspaces. The intention of each agent is provided to other agents, presenting a 2D map of the head that is visually consistent, with state and action representations aligned spatially. Guided by priority-based step rewards, the dual-robots are sequentially assigned high priority, enabling them to pass through complex environments in sequence. We validated our method on a path planning task where two robots collaborate to transport cubes in dynamic environments. The two robots need to consider obstacle avoidance while handing cubes to the human operator. The experimental environment includes Free space, obstacles in the middle, obstacles on both sides, and combinations of several environments. The results show that priority step rewards improve the performance of robot collaborative tasks and significantly enhance cooperative behavior.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002