Obstacle Avoidance through Reinforcement Learning
Tony J. Prescott, John E. W. Mayhew
- Year
- 1991
- Citations
- 24
- Access
- Open access
Abstract
A method is described for generating plan-like. reflexive. obstacle \navoidance behaviour in a mobile robot. The experiments reported here \nuse a simulated vehicle with a primitive range sensor. Avoidance \nbehaviour is encoded as a set of continuous functions of the perceptual \ninput space. These functions are stored using CMACs and trained by a \nvariant of Barto and Sutton's adaptive critic algorithm. As the vehicle \nexplores its surroundings it adapts its responses to sensory stimuli so \nas to minimise the negative reinforcement arising from collisions. \nStrategies for local navigation are therefore acquired in an explicitly \ngoal-driven fashion. The resulting trajectories form elegant collisionfree \npaths through the environment.
Keywords
Related papers
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002