Learning a model-free robotic continuous state-action task through contractive Q-network
Mohammad-Javad Davari, Khalil Alipour, Alireza Hadi, Bahram Tarvirdizadeh
- 发表年份
- 2017
- 引用次数
- 4
摘要
In Reinforcement Learning (RL) working in high dimensional continuous state-action spaces is a challenging issue. Q-learning can be used for this purpose. Neural network is chosen as Function Approximator (FA) for actor and critic in the algorithm. Learning in this context requires many experiments in a simulated environment. A novel method called contractive Q-network for updating the critic FA (Q-network) is proposed in the current research for reducing the number of these experiments. To show the efficiency of the developed method, two illustrative examples are conducted, first in the well-known puddle world and then in Push Recovery (PR) task on a simulated humanoid robot. Results show 20% improvement in convergence speed of the method.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002