首页 /研究 /Behaviors Coordination Using Restless Bandits Allocation Indexes
LEARNING

Behaviors Coordination Using Restless Bandits Allocation Indexes

Yassine Faihe, Jean‐Pierre Müller

发表年份
1998
引用次数
8

摘要

In order to remain viable and to reproduce an animal has to continuously deal with the problem of choosing the right behavior among several others (e.g. obtaining food, obtaining water, avoiding predators, . . . ) at the right time. In robotics this problem arises when we want to synthesize a complex behavior from elementary behaviors. Within the reinforcement learning framework we review the behaviors coordination methods proposed so far. Then we discuss their limitations and propose a new coordination method based on the restless bandits theory. Restless bandits allocation indexes are an extension of the Gittins indexes and are borrowed from the field of optimal scheduling. They concern problems involving the sharing of limited resources between several projects which are being pursued. The performance of the proposed method is illustrated through the postman robot problem and compared to the Hierarchical Q-learning (Lin, 1993). 1. Introduction In order to remain viable and to repr...

关键词

Computer scienceOperations researchPsychologyArtificial intelligenceMathematics

相关论文

查看 LEARNING 分类全部论文