Fast reinforcement learning using stochastic shortest paths for a mobile robot
Woo-Young Kwon, Il Hong Suh, Sanghoon Lee, Young-Jo Cho
- 发表年份
- 2007
- 引用次数
- 9
摘要
Abstract — Reinforcement learning (RL) has been used as a learning mechanism for a mobile robot to learn state-action relations without a priori knowledge of working environment. However, most RL methods usually suffer from slow conver-gence to learn optimum state-action sequence. In this paper, it is intended to improve a learning speed by compounding an existing Q-learning method with a shortest path finding algorithm. To integrate the shortest path algorithm with Q-learning method, a stochastic state-transition model is used to store a previous observed state, a previous action and a current state. Whenever a robot reaches a goal, a Stochastic Shortest Path(SSP) will be found from the stochastic state-transition model. State-action pairs on the SSP will be counted as more significant in the action selection. Using this learning method, the learning speed will be boosted when compared with classical RL methods. To show the validity of our proposed learning technology, several simulations and experimental results will be illustrated. I.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002