首页 /研究 /Reinforcement Learning for Robot Navigation in Nondeterministic Environments
LEARNING

Reinforcement Learning for Robot Navigation in Nondeterministic Environments

Xiaoyun Liu, Qingrui Zhou, Hailin Ren, Changhao Sun

发表年份
2018
引用次数
9

摘要

Mobile robots are commonly used for missions like target searching and security surveillance in unknown environments, where an exact mathematical model may not be available. In this paper, we formulate the problem of mobile robot path planning in unknown environments as a nondeterministic Markov Decision Process (MDP), and provide a model-free reinforcement learning solution in which the modified Q-learning utilizes a combined ε-greedy and Boltzmann exploration. We simulate the validity of the proposed algorithm, and compare the learning process with that of the original Q-learning algorithm. We also analyze the effects of the discounted factor on learning results. Simulations show that the proposed algorithm can generate the shortest path that obtains the maximized accumulated reward in environments having nondeterministic Markovian property given appropriate values of the discounted factor.

关键词

Markov decision processReinforcement learningNondeterministic algorithmMobile robotComputer scienceQ-learningRobotMotion planningMarkov processArtificial intelligence

相关论文

查看 LEARNING 分类全部论文