首页 /研究 /On the (almost) Global Exponential Convergence of the Overparameterized Policy Optimization for the LQR Problem

LEARNING

On the (almost) Global Exponential Convergence of the Overparameterized Policy Optimization for the LQR Problem

Moh Kamalul Wafi, Arthur Castello B. de Oliveira, Eduardo D. Sontag

发表年份: 2025
访问权限: 开放获取

摘要

In this work we study the convergence of gradient methods for nonconvex optimization problems -- specifically the effect of the problem formulation to the convergence behavior of the solution of a gradient flow. We show through a simple example that, surprisingly, the gradient flow solution can be exponentially or asymptotically convergent, depending on how the problem is formulated. We then deepen the analysis and show that a policy optimization strategy for the continuous-time linear quadratic regulator (LQR) (which is known to present only asymptotic convergence globally) presents almost global exponential convergence if the problem is overparameterized through a linear feed-forward neural network (LFFNN). We prove this qualitative improvement always happens for a simplified version of the LQR problem and derive explicit convergence rates for the gradient flow. Finally, we show that both the qualitative improvement and the quantitative rate gains persist in the general LQR through numerical simulations.

关键词

math.OCeess.SY

On the (almost) Global Exponential Convergence of the Overparameterized Policy Optimization for the LQR Problem

摘要

关键词

相关论文

The Organization of Behavior

Fractional Brownian Motions, Fractional Noises and Applications

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

A guide to deep learning in healthcare