Reinforcement learning with heuristic to solve POMDP problem in mobile robot path planning
Widyawardana Adiprawita, Adang Suwandi Ahmad, Jaka Sembiring, Bambang Riyanto Trilaksono
- 发表年份
- 2011
- 引用次数
- 5
摘要
In this paper we propose a method of presenting a special case of Value Function as a solution to POMDP in mobile robot navigation. By using this new method the Value Function complexity will be reduced and more intuitive. We also propose a new reinforcement learning method to solve the Value Function. This reinforcement learning is based on Bellman Equation augmented with A* like heuristic during update iteration. The result of this new Value Function is validated with This particle filter is simulaed in Matlab and also experimented physically using a simple autonomous mobile robot built with Lego Mindstorms NXT with 3 ultrasonic sonar and RWTH Mindstorms NXT Toolbox for Matlab to connect the robot to Matlab. This simulation and experiment also incorporate particle filter localization from previous research. The simulation and experiment show that the Value Function can be utilized very well.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002