首页 /研究 /Learning to coordinate controllers-reinforcement learning on a control basis

LOCOMOTION

Learning to coordinate controllers-reinforcement learning on a control basis

Manfred Huber, Roderic A. Grupen

发表年份: 1997
引用次数: 33
访问权限: 开放获取

摘要

Autonomous robot systems operating in an uncertain environment have to be reactive and adaptive in order to cope with changing environment conditions and task requirements. To achieve this, the hybrid control architecture presented in this paper uses reinforcement learning on top of a Discrete Event Dynamic System (DEDS) framework to learn to supervise a set of basis controllers in order to achieve a given task. The use of an abstract system model in the automatically derived supervisor reduces the complexity of the learning problem. In addition, safety constraints may be imposed a priori, such that the system learns on-line in a single trial without the need for an outside teacher. To demonstrate the applicability of the approach, the architecture is used to learn a turning gait on a four legged robot platform. 1 Introduction Autonomous robot systems operating in an uncertain environment have to be able to cope with new situations and task requirements. Important pr...

关键词

Reinforcement learningSupervisorComputer scienceRobotTask (project management)A priori and a posterioriRobot learningControl engineeringSet (abstract data type)Artificial intelligence

Learning to coordinate controllers-reinforcement learning on a control basis

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory