Reducing the complexity of multiagent reinforcement learning

Andriy Burkov, Brahim Chaib-draa

发表年份: 2007
引用次数: 9

摘要

It is known that the complexity of the reinforcement learning algorithms, such as Q-learning, may be exponential in the number of environment's states. It was shown, however, that the learning complexity for the goal-directed problems may be substantially reduced by initializing the Q-values with a "good" approximative function. In the multiagent case, there exists such a good approximation for a big class of problems, namely, for goal-directed stochastic games. These games, for example, can reflect coordination and common interest problems of cooperative robotics. The approximative function for these games is nothing but the relaxed, single-agent, problem solution, which can easily be found by each agent individually. In this article, we show that (1) an optimal single-agent solution is a "good" approximation for the goal-directed stochastic games with action-penalty representation and (b) the complexity is reduced when the learning is initialized with this approximative function, as compared to the uninformed case.

关键词

Reinforcement learningInitializationComputer scienceFunction (biology)Artificial intelligenceRepresentation (politics)Class (philosophy)Multi-agent systemFunction approximationMathematical optimization

Reducing the complexity of multiagent reinforcement learning

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory