首页 /研究 /Differential Dynamic Programming for time-delayed systems
LEARNING

Differential Dynamic Programming for time-delayed systems

发表年份
2016
引用次数
2

摘要

Trajectory optimization considers the problem of deciding how to control a dynamical system to move along a trajectory which minimizes some cost function. Differential Dynamic Programming (DDP) is an optimal control method which utilizes a second-order approximation of the problem to find the control. It is fast enough to allow real-time control and has been shown to work well for trajectory optimization in robotic systems. Here we extend classic DDP to systems with multiple time-delays in the state. Being able to find optimal trajectories for time-delayed systems with DDP opens up the possibility to use richer models for system identification and control, including recurrent neural networks with multiple timesteps in the state. We demonstrate the algorithm on a two-tank continuous stirred tank reactor. We also demonstrate the algorithm on a recurrent neural network trained to model an inverted pendulum with position information only.

关键词

Differential dynamic programmingTrajectoryDynamic programmingOptimal controlInverted pendulumTrajectory optimizationPosition (finance)Artificial neural networkDynamical systems theory

相关论文

查看 LEARNING 分类全部论文