首页 /研究 /Multiple rewards fuzzy reinforcement learning algorithm in RoboCup environment

LEARNING

Multiple rewards fuzzy reinforcement learning algorithm in RoboCup environment

Li Shi, Yao Jinyi, Zhen Ye

发表年份: 2002
引用次数: 5

摘要

In order to achieve the competition tasks for multicooperating robots through learning, the paper discusses a kind of method that is designed for multi-agent systems (MAS), called the multi-reward fuzzy Q-learning algorithm (MRFQLA), which can be applied to the environment of the Robot World Cup Tournament (RoboCup). In MRFQLA., multiple reinforcement functions are established, based on the different characters of multi-agent systems. When the learning robot executes an action, these functions create multiple reinforcement signals that give the criteria of this action from different points of view. A Takagi-Sugeno (TS) model of a fuzzy inference system is built, which integrates these multiple rewards into one signal as the feedback of the learning robot. This method enhances the efficiency of learning because multiple rewards increase TD error and eliminates the conflict between the short-term target and the long-term one. Computer simulations in the RoboCup environment are shown and a discussion is given.

关键词

Reinforcement learningComputer scienceArtificial intelligenceRobotTournamentFuzzy logicAction (physics)Robot learningAction selectionTerm (time)

Multiple rewards fuzzy reinforcement learning algorithm in RoboCup environment

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory