首页 /研究 /Combining the benefits of function approximation and trajectory optimization

LEARNING

Combining the benefits of function approximation and trajectory optimization

Igor Mordatch, Emo Todorov

发表年份: 2014
引用次数: 96
访问权限: 开放获取

摘要

Neural networks have recently solved many hard problems in Machine Learning, but their impact in control remains limited. Trajectory optimization has recently solved many hard problems in robotic control, but using it online remains challenging. Here we leverage the high-fidelity solutions obtained by trajectory optimization to speed up the training of neural network controllers. The two learning problems are coupled using the Alternating Direction Method of Multipliers (ADMM). This coupling enables the trajectory optimizer to act as a teacher, gradually guiding the network towards better solutions. We develop a new trajectory optimizer based on inverse contact dynamics, and provide not only the trajectories but also the feedback gains as training data to the network. The method is illustrated on rolling, reaching, swimming and walking tasks.

关键词

TrajectoryComputer scienceFunction (biology)Trajectory optimizationMathematical optimizationMathematicsPhysics

Combining the benefits of function approximation and trajectory optimization

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Fractional Differential Equations

Applied Nonlinear Control