首页 /研究 /Critic Only Policy Iteration-based Zero-sum Neuro-optimal Control of Modular and Reconfigurable Robots with uncertain disturbance via Adaptive Dynamic Programming

LEARNING

Critic Only Policy Iteration-based Zero-sum Neuro-optimal Control of Modular and Reconfigurable Robots with uncertain disturbance via Adaptive Dynamic Programming

Tianjiao An, Jingchen Chen, Xinye Zhu, Yuanchun Li, Keping Liu, Bo Dong

发表年份: 2020
引用次数: 4

摘要

A critic only policy iteration (COPI) scheme-based zero-sum neuro-optimal control method has been presented via adaptive dynamic programming (ADP) to address optimal trajectory velocity and tracking control of modular and reconfigurable robots (MRRs) problem. Based on policy iteration (PI) and ADP method, Hamilton-Jacobi-Issacs (HJI) equation is addressed by using only critic neural network (NN). The approximated optimal control can be obtained. Closed-loop system is proved to be asymptotic stable according to the Lyapunov theory. At last, simulations are demonstrated to show effectiveness of method.

关键词

Dynamic programmingOptimal controlControl theory (sociology)Modular designArtificial neural networkTrajectoryComputer scienceMathematical optimizationAdaptive controlLyapunov function

Critic Only Policy Iteration-based Zero-sum Neuro-optimal Control of Modular and Reconfigurable Robots with uncertain disturbance via Adaptive Dynamic Programming

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory