Obstacle Avoidance through Reinforcement Learning
Tony J. Prescott, John E. W. Mayhew
- 发表年份
- 1991
- 引用次数
- 24
- 访问权限
- 开放获取
摘要
A method is described for generating plan-like. reflexive. obstacle \navoidance behaviour in a mobile robot. The experiments reported here \nuse a simulated vehicle with a primitive range sensor. Avoidance \nbehaviour is encoded as a set of continuous functions of the perceptual \ninput space. These functions are stored using CMACs and trained by a \nvariant of Barto and Sutton's adaptive critic algorithm. As the vehicle \nexplores its surroundings it adapts its responses to sensory stimuli so \nas to minimise the negative reinforcement arising from collisions. \nStrategies for local navigation are therefore acquired in an explicitly \ngoal-driven fashion. The resulting trajectories form elegant collisionfree \npaths through the environment.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002