Efficient experience reuse in non-Markovian environments

Le Tien Dung, Takashi Komeda, Motoki Takagi

发表年份: 2008
引用次数: 6

摘要

Learning time is always a critical issue in Reinforcement Learning, especially when Recurrent Neural Networks are used to predict Q values in non-Markovian environments. Experience reuse has been received much attention due to its ability to reduce learning time. In this paper, we propose a new method to efficiently reuse experience. Our method generates new episodes from recorded episodes using an action-pair merger. Recorded episodes and new episodes are replayed after each learning epoch. We compare our method with standard online learning, and learning using experience replay in a vision based robot problem. The results show the potential of this approach.

关键词

ReuseComputer scienceReinforcement learningAction (physics)Artificial intelligenceMarkov decision processRobotMarkov processMachine learningEngineering

Efficient experience reuse in non-Markovian environments

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory