Differential Dynamic Programming for time-delayed systems

发表年份: 2016
引用次数: 2

摘要

Trajectory optimization considers the problem of deciding how to control a dynamical system to move along a trajectory which minimizes some cost function. Differential Dynamic Programming (DDP) is an optimal control method which utilizes a second-order approximation of the problem to find the control. It is fast enough to allow real-time control and has been shown to work well for trajectory optimization in robotic systems. Here we extend classic DDP to systems with multiple time-delays in the state. Being able to find optimal trajectories for time-delayed systems with DDP opens up the possibility to use richer models for system identification and control, including recurrent neural networks with multiple timesteps in the state. We demonstrate the algorithm on a two-tank continuous stirred tank reactor. We also demonstrate the algorithm on a recurrent neural network trained to model an inverted pendulum with position information only.

关键词

Differential dynamic programmingTrajectoryDynamic programmingOptimal controlInverted pendulumTrajectory optimizationPosition (finance)Artificial neural networkDynamical systems theory

Differential Dynamic Programming for time-delayed systems

摘要

关键词

相关论文

A new optimizer using particle swarm theory

Self-Organizing Maps

The Organization of Behavior

Fractional Brownian Motions, Fractional Noises and Applications