LEARNING
A robot that reinforcement-leams to identify and memorize important previous observations
Boudewijn Bakker, Viktor Zhumatiy, G. Gruener, Jürgen Schmidhuber
- 发表年份
- 2004
- 引用次数
- 56
摘要
It is difficult to apply traditional reinforcement learning algorithms to robots, due to problems with large and continuous domains, partial observability, and limited numbers of learning experiences. This paper deals with these problems by combining: (1) reinforcement learning with memory, implemented using an LSTM recurrent neural network whose inputs are discrete events extracted from raw inputs; (2) online exploration and offline policy learning. An experiment with a real robot demonstrates the methodology's feasibility.
关键词
Reinforcement learningObservabilityComputer scienceArtificial intelligenceRobotMemorizationMachine learningRobot learningArtificial neural networkReinforcement
相关论文
OTHER
📊 26,957 引用
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
PERCEPTION
📊 22,245 引用
Artificial intelligence: a modern approach
1995
OTHER
📊 18,993 引用
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
SWARM
📊 14,853 引用
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002