QMIX Algorithm for Coordinated Welding of Multiple Robots
Liangchuang Liao, Yang‐Yang Chen
- 发表年份
- 2021
- 引用次数
- 4
摘要
This paper considers the coordinated welding control based on deep multi-agent reinforcement learning. The discrete-time states and actions with local observation for the welding robots and the characteristics of the weld lines (e.g., the starting points of the weld lines and the ending points of the weld lines) are defined, which is suitable to use the monotonic value function factorisation for deep multi-agent reinforcement learning (QMIX) algorithm. A novel reward composed of trajectory optimization, collision avoidance and the task done is designed, which is proved by the simulation in the grid-world environment.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002