Home /Research /Reinforcement Learning for Robot Navigation in Nondeterministic Environments
LEARNING

Reinforcement Learning for Robot Navigation in Nondeterministic Environments

Xiaoyun Liu, Qingrui Zhou, Hailin Ren, Changhao Sun

Year
2018
Citations
9

Abstract

Mobile robots are commonly used for missions like target searching and security surveillance in unknown environments, where an exact mathematical model may not be available. In this paper, we formulate the problem of mobile robot path planning in unknown environments as a nondeterministic Markov Decision Process (MDP), and provide a model-free reinforcement learning solution in which the modified Q-learning utilizes a combined ε-greedy and Boltzmann exploration. We simulate the validity of the proposed algorithm, and compare the learning process with that of the original Q-learning algorithm. We also analyze the effects of the discounted factor on learning results. Simulations show that the proposed algorithm can generate the shortest path that obtains the maximized accumulated reward in environments having nondeterministic Markovian property given appropriate values of the discounted factor.

Keywords

Markov decision processReinforcement learningNondeterministic algorithmMobile robotComputer scienceQ-learningRobotMotion planningMarkov processArtificial intelligence

Related papers

Browse all LEARNING papers