首页 /研究 /Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics

MANIPULATION

Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics

Sergey Levine, Pieter Abbeel

发表年份: 2014
引用次数: 402

摘要

We present a policy search method that uses iteratively refitted local linear models to optimize trajectory distributions for large, continuous problems. These tra-jectory distributions can be used within the framework of guided policy search to learn policies with an arbitrary parameterization. Our method fits time-varying linear dynamics models to speed up learning, but does not rely on learning a global model, which can be difficult when the dynamics are complex and discontinuous. We show that this hybrid approach requires many fewer samples than model-free methods, and can handle complex, nonsmooth dynamics that can pose a challenge for model-based techniques. We present experiments showing that our method can be used to learn complex neural network policies that successfully execute simulated robotic manipulation tasks in partially observed environments with nu-merous contact discontinuities and underactuation. 1

关键词

TrajectoryComputer scienceArtificial neural networkArtificial intelligenceDynamics (music)Classification of discontinuitiesMachine learningVehicle dynamicsMathematicsEngineering

Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Fractional Differential Equations

Applied Nonlinear Control