Planning with an Adaptive World Model

Sebastian Thrun, Knut Möller, Alexander Linden

发表年份: 1990
引用次数: 24

摘要

We present a new connectionist planning method [TML90]. By interaction with an unknown environment, a world model is progressively constructed using gradient descent. For deriving optimal actions with respect to future reinforcement, planning is applied in two steps: an experience network proposes a plan which is subsequently optimized by gradient descent with a chain of world models, so that an optimal reinforcement may be obtained when it is actually run. The appropriateness of this method is demonstrated by a robotics application and a pole balancing task.

关键词

Reinforcement learningGradient descentComputer scienceConnectionismArtificial intelligenceTask (project management)RoboticsPlan (archaeology)Machine learningMathematical optimization

Planning with an Adaptive World Model

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Fractional Differential Equations

Applied Nonlinear Control