OTHER
Planning with an Adaptive World Model
Sebastian Thrun, Knut Möller, Alexander Linden
- 发表年份
- 1990
- 引用次数
- 24
摘要
We present a new connectionist planning method [TML90]. By interaction with an unknown environment, a world model is progressively constructed using gradient descent. For deriving optimal actions with respect to future reinforcement, planning is applied in two steps: an experience network proposes a plan which is subsequently optimized by gradient descent with a chain of world models, so that an optimal reinforcement may be obtained when it is actually run. The appropriateness of this method is demonstrated by a robotics application and a pole balancing task.
关键词
Reinforcement learningGradient descentComputer scienceConnectionismArtificial intelligenceTask (project management)RoboticsPlan (archaeology)Machine learningMathematical optimization
相关论文
OTHER
📊 26,957 引用
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
PERCEPTION
📊 22,245 引用
Artificial intelligence: a modern approach
1995
OTHER
开放获取📊 20,501 引用
Fractional Differential Equations
Igor Podlubný
2025
OTHER
📊 18,993 引用
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991