Trust region
Related papers: 20
Top Researchers
Top Cited Papers
Trust Region Policy Optimization
John Schulman, Sergey Levine, Philipp Moritz, Michael I. Jordan, Pieter Abbeel
Citations: 3141 • 2015
Guided Policy Search via Approximate Mirror Descent
William Montgomery, Sergey Levine
Citations: 83 • 2016
An incremental trust-region method for Robust online sparse least-squares estimation
David M. Rosen, Michael Kaess, John J. Leonard
Citations: 73 • 2012
A hybrid conjugate gradient based approach for solving unconstrained optimization and motion control problems
Auwal Bala Abubakar, Poom Kumam, Maulana Malik, Abdulkarim Hassan Ibrahim
Citations: 51 • 2021
Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies
Kaiqing Zhang, Alec Koppel, Hao Zhu, Tamer Başar
Citations: 44 • 2019
A new trust region–sequential quadratic programming approach for nonlinear systems based on nonlinear model predictive control
Zhongbo Sun, Yifang Sun, Y. Li, K.P. Liu
Citations: 41 • 2018
Optimizing Expectations: From Deep Reinforcement Learning to Stochastic Computation Graphs
John Schulman
Citations: 37 • 2016
TRC: Trust Region Conditional Value at Risk for Safe Reinforcement Learning
Dohyeong Kim, Songhwai Oh
Citations: 21 • 2022
Reinforcement Learning for UAV Attitude Control
William R. Koch, Renato Mancuso, Richard West, Azer Bestavros
Citations: 19 • 2019
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value At Risk
Dohyeong Kim, Songhwai Oh
Citations: 17 • 2022
A Differentiable Augmented Lagrangian Method for Bilevel Nonlinear Optimization
Benoit Landry, Zachary Manchester, Marco Pavone
Citations: 16 • 2019
Guided Policy Search as Approximate Mirror Descent
William Montgomery, Sergey Levine
Citations: 15 • 2016
Stochastic Variance Reduction for Policy Gradient Estimation
Tian-Bing Xu, Qiang Liu, Jian Peng
Citations: 10 • 2017
Trust dampening and trust promoting: A dual-pathway of trust calibration in human-robot interaction
Xinyu HUANG, Ye Li
Citations: 7 • 2024
Two steps natural actor critic learning for underwater cable tracking
Andrés El-Fakdi, Marc Carreras, Enric Galceran
Citations: 7 • 2010
Bayesian Optimization Based Trust Model for Human Multi-robot Collaborative Motion Tasks in Offroad Environments
Huanfei Zheng, Jonathon M. Smereka, Dariusz Mikulski, Yue Wang
Citations: 6 • 2023
Smoothing Policies and Safe Policy Gradients
Matteo Papini, Matteo Pirotta, Marcello Restelli
Citations: 6 • 2019
Hindsight Trust Region Policy Optimization
Hanbo Zhang, Xuguang Lan, David Hsu, Nanning Zheng
Citations: 5 • 2021
A Fast and Robust Algorithm for General Inequality/Equality Constrained Minimum-Time Problems
B.J. Driessen, Nader Sadegh, Gordon G. Parker, G. Richard Eisler
Citations: 5 • 1999
Deep Black-Box Reinforcement Learning with Movement Primitives
Fabian Otto, Onur Çelik, Hongyi Zhou, Hanna Ziesche, Ngo Anh Vien, Gerhard Neumann
Citations: 5 • 2022