首页 /研究 /Learning a model-free robotic continuous state-action task through contractive Q-network
LEARNING

Learning a model-free robotic continuous state-action task through contractive Q-network

Mohammad-Javad Davari, Khalil Alipour, Alireza Hadi, Bahram Tarvirdizadeh

发表年份
2017
引用次数
4

摘要

In Reinforcement Learning (RL) working in high dimensional continuous state-action spaces is a challenging issue. Q-learning can be used for this purpose. Neural network is chosen as Function Approximator (FA) for actor and critic in the algorithm. Learning in this context requires many experiments in a simulated environment. A novel method called contractive Q-network for updating the critic FA (Q-network) is proposed in the current research for reducing the number of these experiments. To show the efficiency of the developed method, two illustrative examples are conducted, first in the well-known puddle world and then in Push Recovery (PR) task on a simulated humanoid robot. Results show 20% improvement in convergence speed of the method.

关键词

Reinforcement learningConvergence (economics)Computer scienceTask (project management)Context (archaeology)Humanoid robotArtificial neural networkState (computer science)Artificial intelligenceAction (physics)

相关论文

查看 LEARNING 分类全部论文