Learning Parameterized Skills

Bruno da Silva, George Konidaris, Andrew G. Barto

发表年份: 2012
引用次数: 71
访问权限: 开放获取

摘要

We introduce a method for constructing skills capable of solving tasks drawn from a distribution of parameterized reinforcement learning problems. The method draws example tasks from a distribution of interest and uses the corresponding learned policies to estimate the topology of the lower-dimensional piecewise-smooth manifold on which the skill policies lie. This manifold models how policy parameters change as task parameters vary. The method identifies the number of charts that compose the manifold and then applies non-linear regression in each chart to construct a parameterized skill by predicting policy parameters from task parameters. We evaluate our method on an underactuated simulated robotic arm tasked with learning to accurately throw darts at a parameterized target location.

关键词

Parameterized complexityTask (project management)Manifold (fluid mechanics)Computer sciencePiecewiseConstruct (python library)Piecewise linear functionReinforcement learningDistribution (mathematics)Mathematical optimization

Learning Parameterized Skills

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory