Obstacle Avoidance through Reinforcement Learning

Tony J. Prescott, John E. W. Mayhew

发表年份: 1991
引用次数: 24
访问权限: 开放获取

摘要

A method is described for generating plan-like. reflexive. obstacle\navoidance behaviour in a mobile robot. The experiments reported here\nuse a simulated vehicle with a primitive range sensor. Avoidance\nbehaviour is encoded as a set of continuous functions of the perceptual\ninput space. These functions are stored using CMACs and trained by a\nvariant of Barto and Sutton's adaptive critic algorithm. As the vehicle\nexplores its surroundings it adapts its responses to sensory stimuli so\nas to minimise the negative reinforcement arising from collisions.\nStrategies for local navigation are therefore acquired in an explicitly\ngoal-driven fashion. The resulting trajectories form elegant collisionfree\npaths through the environment.

关键词

Obstacle avoidanceCollision avoidanceReinforcement learningMobile robotComputer scienceObstacleArtificial intelligenceSet (abstract data type)RobotPerception

Obstacle Avoidance through Reinforcement Learning

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory