Trust region

Related papers: 20

Top Cited Papers

Trust Region Policy Optimization

John Schulman, Sergey Levine, Philipp Moritz, Michael I. Jordan, Pieter Abbeel

Citations: 3141 • 2015

Guided Policy Search via Approximate Mirror Descent

William Montgomery, Sergey Levine

Citations: 83 • 2016

An incremental trust-region method for Robust online sparse least-squares estimation

David M. Rosen, Michael Kaess, John J. Leonard

Citations: 73 • 2012

A hybrid conjugate gradient based approach for solving unconstrained optimization and motion control problems

Auwal Bala Abubakar, Poom Kumam, Maulana Malik, Abdulkarim Hassan Ibrahim

Citations: 51 • 2021

Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies

Kaiqing Zhang, Alec Koppel, Hao Zhu, Tamer Başar

Citations: 44 • 2019

A new trust region–sequential quadratic programming approach for nonlinear systems based on nonlinear model predictive control

Zhongbo Sun, Yifang Sun, Y. Li, K.P. Liu

Citations: 41 • 2018

Optimizing Expectations: From Deep Reinforcement Learning to Stochastic Computation Graphs

John Schulman

Citations: 37 • 2016

TRC: Trust Region Conditional Value at Risk for Safe Reinforcement Learning

Dohyeong Kim, Songhwai Oh

Citations: 21 • 2022

Reinforcement Learning for UAV Attitude Control

William R. Koch, Renato Mancuso, Richard West, Azer Bestavros

Citations: 19 • 2019

Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value At Risk

Dohyeong Kim, Songhwai Oh

Citations: 17 • 2022

A Differentiable Augmented Lagrangian Method for Bilevel Nonlinear Optimization

Benoit Landry, Zachary Manchester, Marco Pavone

Citations: 16 • 2019

Guided Policy Search as Approximate Mirror Descent

William Montgomery, Sergey Levine

Citations: 15 • 2016

Stochastic Variance Reduction for Policy Gradient Estimation

Tian-Bing Xu, Qiang Liu, Jian Peng

Citations: 10 • 2017

Trust dampening and trust promoting: A dual-pathway of trust calibration in human-robot interaction

Xinyu HUANG, Ye Li

Citations: 7 • 2024

Two steps natural actor critic learning for underwater cable tracking

Andrés El-Fakdi, Marc Carreras, Enric Galceran

Citations: 7 • 2010

Bayesian Optimization Based Trust Model for Human Multi-robot Collaborative Motion Tasks in Offroad Environments

Huanfei Zheng, Jonathon M. Smereka, Dariusz Mikulski, Yue Wang

Citations: 6 • 2023

Smoothing Policies and Safe Policy Gradients

Matteo Papini, Matteo Pirotta, Marcello Restelli

Citations: 6 • 2019

Hindsight Trust Region Policy Optimization

Hanbo Zhang, Xuguang Lan, David Hsu, Nanning Zheng

Citations: 5 • 2021

A Fast and Robust Algorithm for General Inequality/Equality Constrained Minimum-Time Problems

B.J. Driessen, Nader Sadegh, Gordon G. Parker, G. Richard Eisler

Citations: 5 • 1999

Deep Black-Box Reinforcement Learning with Movement Primitives

Fabian Otto, Onur Çelik, Hongyi Zhou, Hanna Ziesche, Ngo Anh Vien, Gerhard Neumann

Citations: 5 • 2022