首页 /研究 /Learning to coordinate controllers-reinforcement learning on a control basis
LOCOMOTION

Learning to coordinate controllers-reinforcement learning on a control basis

Manfred Huber, Roderic A. Grupen

发表年份
1997
引用次数
33
访问权限
开放获取

摘要

Autonomous robot systems operating in an uncertain environment have to be reactive and adaptive in order to cope with changing environment conditions and task requirements. To achieve this, the hybrid control architecture presented in this paper uses reinforcement learning on top of a Discrete Event Dynamic System (DEDS) framework to learn to supervise a set of basis controllers in order to achieve a given task. The use of an abstract system model in the automatically derived supervisor reduces the complexity of the learning problem. In addition, safety constraints may be imposed a priori, such that the system learns on-line in a single trial without the need for an outside teacher. To demonstrate the applicability of the approach, the architecture is used to learn a turning gait on a four legged robot platform. 1 Introduction Autonomous robot systems operating in an uncertain environment have to be able to cope with new situations and task requirements. Important pr...

关键词

Reinforcement learningSupervisorComputer scienceRobotTask (project management)A priori and a posterioriRobot learningControl engineeringSet (abstract data type)Artificial intelligence

相关论文

查看 LOCOMOTION 分类全部论文