首页 /研究 /Distributed neural network-based policy gradient reinforcement learning for multi-robot formations
SWARM

Distributed neural network-based policy gradient reinforcement learning for multi-robot formations

Wen Shang, Dong Sun

发表年份
2008
引用次数
3

摘要

Multi-robot learning is a challenging task not only because of large and continuous state/action spaces, but also uncertainty and partial observability during learning. This paper presents a distributed policy gradient reinforcement learning (PGRL) methodology of a multi-robot system using neural network as the function approximator. This distributed PGRL algorithm enables each robot to independently decide its policy, which is, however, affected by all the other robots. Neural network is used to generalize over continuous state space as well as discrete/continuous action spaces. A case study on leader-follower formation application is performed to demonstrate the effectiveness of the proposed learning method.

关键词

ObservabilityReinforcement learningComputer scienceRobotArtificial neural networkArtificial intelligenceState spaceAction (physics)Robot learningFunction (biology)

相关论文

查看 SWARM 分类全部论文