Behaviors Coordination Using Restless Bandits Allocation Indexes

Yassine Faihe, Jean‐Pierre Müller

发表年份: 1998
引用次数: 8

摘要

In order to remain viable and to reproduce an animal has to continuously deal with the problem of choosing the right behavior among several others (e.g. obtaining food, obtaining water, avoiding predators, . . . ) at the right time. In robotics this problem arises when we want to synthesize a complex behavior from elementary behaviors. Within the reinforcement learning framework we review the behaviors coordination methods proposed so far. Then we discuss their limitations and propose a new coordination method based on the restless bandits theory. Restless bandits allocation indexes are an extension of the Gittins indexes and are borrowed from the field of optimal scheduling. They concern problems involving the sharing of limited resources between several projects which are being pursued. The performance of the proposed method is illustrated through the postman robot problem and compared to the Hierarchical Q-learning (Lin, 1993). 1. Introduction In order to remain viable and to repr...

关键词

Computer scienceOperations researchPsychologyArtificial intelligenceMathematics

Behaviors Coordination Using Restless Bandits Allocation Indexes

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Fractional Differential Equations

Applied Nonlinear Control