Reinforcement Learning for Robot Navigation in Nondeterministic Environments
Xiaoyun Liu, Qingrui Zhou, Hailin Ren, Changhao Sun
- Year
- 2018
- Citations
- 9
Abstract
Mobile robots are commonly used for missions like target searching and security surveillance in unknown environments, where an exact mathematical model may not be available. In this paper, we formulate the problem of mobile robot path planning in unknown environments as a nondeterministic Markov Decision Process (MDP), and provide a model-free reinforcement learning solution in which the modified Q-learning utilizes a combined ε-greedy and Boltzmann exploration. We simulate the validity of the proposed algorithm, and compare the learning process with that of the original Q-learning algorithm. We also analyze the effects of the discounted factor on learning results. Simulations show that the proposed algorithm can generate the shortest path that obtains the maximized accumulated reward in environments having nondeterministic Markovian property given appropriate values of the discounted factor.
Keywords
Related papers
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002