Deep Reinforcement Learning for Humanoid Robot Dribbling
Alexandre Muzio, Marcos R. O. A. Máximo, Takashi Yoneyama
- 发表年份
- 2020
- 引用次数
- 17
摘要
Humanoid robot soccer is a very traditional competitive task that aims to push the boundaries of state-of-the-art robotics. One of the many challenges of playing soccer is walking and running while not losing balance. Deep Reinforcement Learning (DRL) has been used to solve complex continuous control problems such as those in robotics. In this work, we focused on learning humanoid robot behavior to dribble a ball against a single opponent. Instead of learning how to control joint commands directly, we adopt an approach where the learning agent interacts with a predefined walking engine. Using DRL model-free algorithms (namely, Deep Deterministic Policy Gradients, Trust Region Policy Optimization, and Proximal Policy Optimization), we effectively learn a high level policy that allows a humanoid robot to fulfill this task. Finally, the learned dribble policy was evaluated on a simulated Nao robot from the RoboCup 3D Soccer Simulation League. According to our results, the learned agent was able to surpass the hand-coded behavior effectively used by the ITAndroids robotics team in the RoboCup competition.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002