Practical Reinforcement Learning in Continuous Spaces

William D. Smart, Leslie Pack Kaelbling

发表年份: 2000
引用次数: 211

摘要

Dynamic control tasks are good candidates for the application of reinforcement learning techniques. However, many of these tasks inherently have continuous state or action variables. This can cause problems for traditional reinforcement learning algorithms which assume discrete states and actions. In this paper, we introduce an algorithm that safely approximates the value function for continuous state control tasks, and that learns quickly from a small amount of data. We give experimental results using this algorithm to learn policies for both a simulated task and also for a real robot, operating in an unaltered environment. The algorithm works well in a traditional learning setting, and demonstrates extremely good learning when bootstrapped with a small amount of human-provided data. 1.

关键词

Reinforcement learningComputer scienceTask (project management)Artificial intelligenceBellman equationLearning classifier systemRobotState (computer science)Action (physics)Control (management)

Practical Reinforcement Learning in Continuous Spaces

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory